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Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score 
Sequence : 

Scoring table 



Searched : 



March 4, 2004, 15:21:50 ; Search time 82.0638 Seconds 

(without alignments) 
1397.867 Million cell updates/sec 

US-09-668-314C-2_COPY_63_4 68 
2113 

1 ALEPALAS PAGAANFLAMVD STEDVASNCVPAQSLSEPIL 406 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



1586107 



Database : 



1586107 seqs, 282547505 residues 
Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

A_Geneseq__2 9 Jan 04 : * 

1 : geneseqpl9 8 0s : * 

2: geneseqpl990s : * 

3: geneseqp2 000s : * 

4: geneseqp2001s : * 

5 : geneseqp2002s : * 

6 : geneseqp2 003as : * 

7: geneseqp2003bs : * 

8: geneseqp2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed 
and is derxved by analysis of the total score distribute 



on 



SUMMARIES 



% 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


1 


2113 


100. 0 


413 


4 


AAE10658 


2 


2113 


100. 0 


413 


4 


AAE02610 


3 


2113 


100. 0 


413 


5 


ABB78619 


4 


2113 


100. 0 


475 


4 


AAE10657 


5 


2113 


100. 0 


475 


4 


AAE02609 


6 


2113 


100. 0 


475 


5 


ABB78618 


7 


2113 


100. 0 


481 


4 


AAG75592 


8 


2113 


100. 0 


518 


2 


AAW61362 


9 


2113 


100.0 


518 


2 


AAY13799 



Description 



Aael0658 
Aae02610 
Abb78619 
Aael0657 
Aae02609 
Abb78618 
Aag75592 
Aaw61362 
Aayl3799 



Acid-proc 
Human aci 
Asp-ldelt 
Secreted 
Human sec 
Secreted 
Human col 
Aspartic 
Human asp 



10 


2113 


100 


. 0 


518 


2 


AAY2 223 9 


11 


2113 


100 


. 0 


518 


2 


AAY4 1714 


12 


2113 


100 


. 0 


518 


3 


AAY8 842 4 


13 


2113 


100 


. 0 


518 


3 


AAB4 427 0 


14 


2113 


100 


. 0 


518 


4 


AAU072 0 1 


15 


2113 


100 


. 0 


518 


4 


AAE1062 8 


16 


2113 


100 


. 0 


518 


4 


AAE1 065 6 


17 


2113 


100 


. 0 


518 


4 


AAE0685 8 


18 


2113 


100 


. 0 


518 


4 

X 


AAE02 60 8 


19 


2113 


100 


. 0 


518 


4 


AAE0258 0 


20 


2113 


100, 


. 0 


518 


4 

X 


AAU2 90 5 9 


21 


2113 


100 . 


. 0 


518 


4 

x 


AAU0 6602 


22 


2113 


100 , 


. 0 


518 


5 


ABB06531 


23 


2113 


100 . 


. 0 


518 


5 


ABB7 8 58 9 


24 


2113 


100 . 


. 0 


518 


5 


ABB7861 7 


25 


2113 


100. 


. 0 


518 


5 


ABB074 53 


26 


2113 


100 , 


■ 0 


518 


6 


ABU5 8 4 3 5 


27 


2113 


100 . 


, 0 


518 


6 


ABU8 7 9 8 3 


28 


2113 


100 . 


0 


518 


6 


ABU842 98 


29 


2113 


100 . 


0 


518 


6 


ABR661 19 


30 


2113 


100 . 


0 


518 


6 




31 


2113 


100 . 


0 


518 


6 


ARTI9 9 5 OP 


32 


2113 


100 . 


0 


518 


6 


ABU 82 741 


33 


2113 


100 . 


0 


518 


6 


^ ^ » J U U J (J U 


34 


2113 


100 . 


0 


518 


6 


ARRfifi 111 


35 


2113 


100 . 


0 


518 


6 


ABU9 6T 64 


36 


2113 


100 . 


0 


518 

■w' —L \J 


6 


ABU 9 7 5 9 S 


37 


2113 


100. 


0 


518 


6 


ABO 0 8 67? 


38 


2113 


100. 


0 


518 


6 


ABO027P4 


39 


2113 


100. 


0 


518 


6 


ABR7 4 87 R 


40 


2113 


100 . 


o 


518 




ARRQ4 fidfi 


41 


2113 


100. 


0 


518 


6 


AB025216 


42 


2113 


100. 


0 


518 


6 


ABU85613 


43 


2113 


100. 


0 


518 


6 


ABU98773 


44 


2113 


100. 


0 


518 


6 


ABU97988 


45 


2113 


100. 


0 


518 


6 


ABU91694 



Aay22239 

Aay41714 

Aay88424 

Aab44270 

Aau07201 

Aael0628 

Aael0656 

Aae06858 

Aae02608 

Aae02580 

Aau2 905 9 

Aau06602 

Abb06531 

Abb78589 

Abb78617 

Abb07453 

Abu58435 

Abu87983 

Abu84298 

Abr66172 

Abr65562 

Abu99502 

Abu82741 

Abu89862 

Abr68111 

Abu96164 

Abu92595 

Abo08672 

Abo02724 

Abr74878 

Abr94640 

Abo25216 

Abu85613 

Abu98773 

Abu97988 

Abu91694 



Human 
Human 
Human 
Human 
Human 
Human 
Human 
Human 
Human 
Human 
Human 
Human 
Human 
Human 
Human 
Human 
Human 
Novel 
Human 
Human 
Human 
Human 
Human 
Novel 
Human 
Novel 
Human 
Human 
Human 
Human 
Human 
Novel 
Human 
Novel 
Novel 
Novel 



CSP 
PRO 
asp 
PRO 
asp 
asp 
-Asp 
asp 
Asp 
asp 
PRO 
Asp 
asp 
Asp 
Asp 
BAC 
PRO 
hum 
sec 
sec 
sec 
sec 
PRO 
hum 
sec 
hum 
sec 
sec 
sec 
sec 
sec 
hum 
PRO 
hum 
hum 
hum 



ALIGNMENTS 



RESULT 1 
AAE10658 

ID AAE10658 standard; protein; 413 AA. 
XX 

AC AAE10658; 
XX 

DT 10-DEC-2001 (first entry) 
XX 
DE 
XX 
KW 
KW 
KW 
XX 

OS Homo sapiens . 
OS Synthetic. 



Acid-processed hu-Asp 1 lacking TM domain and containing (His) 6 tag. 

Human; aspartyl protease 1; Aspl; amyloid precursor protein; APP; 
Alzheimer's disease; AD; dementia; neurofibrillary tangle; gliosis; 
amyloid plaque; neuronal loss; proteolytic; nootropic; neuroprotective. 



XX 

PN GB2357767-A. 
XX 

PD 04-JUL-2001. 
XX 

PF 22-SEP-2000; 2000GB-00023315 . 
XX 

PR 23-SEP-1999; 99US-00404133 . 

PR 23-SEP-1999; 99US- 01554 93P . 

PR 23-SEP-1999; 9 9WO-US02 08 8 1 . 

PR 13-OCT-1999; 99US-004 16901 . 

PR 06-DEC-1999; 99US-0169232P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Bienkowkski MJ, Gurney M; 
XX 

DR WPI; 2001-444208/48. 
XX 

PT Polypeptide comprising fragments of human aspartyl protease with amyloid 
precursor protein processing activity and alpha-secretase activity, for 
identifying modulators useful in treating Alzheimer's disease. 



PT 
PT 
XX 



PS Example 14; Page 158; 187pp; English. 
XX 

CC The patent discloses human aspartyl protease 1 (hu-Aspl) or modified Aspl 

CC proteins which lack transmembrane domain or amino terminal domain or 

CC cytoplasmic domain and retains alpha-secretase activity and amyloid 

CC protein precursor (APP) processing activity. The proteins of the 

CC invention are useful for assaying hu-Aspl alpha-secretase activity, which 

CC in turn is useful for identifying modulators of hu-Aspl alpha-secretase 

CC activity, where modulators that increase hu-Aspl alpha-secretase activity 

CC are useful for treating Alzheimer's disease (AD) which causes progressive 

CC dementia with consequent formation of amyloid plaques, neurofibrillary 

CC tangles, gliosis and neuronal loss. Hu-Aspl protease substrate is useful 

CC for assaying hu-Aspl proteolytic activity, by contacting hu-Aspl protein 

CC with the substrate under acidic conditions and determining the level of 

CC hu-Aspl proteolytic activity. The present sequence is the acid- processed 

CC form of human Asp 1 protein lacking a transmembrane (TM) domain and 

CC containing (His) 6 tag 
XX 

SQ Sequence 413 AA; 



Query Match 100.0%; Score 2113; DB 4; Length 413; 

Best Local Similarity 100.0%; Pred. No. 1.5e-198; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

QY 1 ALEPALAS PAGAANFIAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

I 1 I 1 I I I I I I M I I I I I I I I I I I i I M I I | | | | | I | |[ | I I I I I I I I M M I I I I I I I M 
Db 1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 120 
I I i I i I I I I I I I I M I I M I I I I I I I I I i | I | | | I | | | | | | | | | | | | | | | | | | | M | | | | 
Db 61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 120 



Qy 



121 SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 
I I I I I I I M M ! I I I I I I I I I I I I j I I I I | I | | | | | | | | | | | M I I t I I M I | I I | | I I I 



UD 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
N 1 f 1 1 1 1 1 1 f 1 1 1 1 1 1 1 ! 1 1 1 1 1 I 1 1 j I | | f | | | | f | | | | | | | | | | | j | | | | | | | | | | | 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA.lv 


240 




181 


240 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKISI YL 

1 t 1 M 1 1 I 1 1 | I I M I ! 1 I I I I 1 1 1 I 1 I I I I ! f | | | | | | | | M | | | | | { | | | | j | f ,, | | 

DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 




241 


300 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 
M 1 1 1 1 1 1 1 1 I I I I I I I I I | | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 i 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I I | 1 | ] | | | | | | | | | | j | | | | | | | | | | | 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 


360 


Db 


301 


360 


Qy 


361 


QKRVG FAAS P C AE I AGAAVS E I S G P FS T EDVASNCVPAQ S L S E P I L 4 06 

' 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 406 




Db 


361 





Human acid-processed form of aspartyl protease-1 deltaTM (His) 6 protein. 

Human; alpha-secretase ; amyloid precursor protein; APP; therapy; 
Alzheimer's disease; antialzheimer ' s ; aspartyl protease 1; Aspl; 
beta-secretase; acid-processed Asp-1 deltaTM (His) 6 protein. 



RESULT 2 
AAE02610 

ID AAE02610 standard; protein; 413 AA, 
XX 

AC AAE02 610; 
XX 

DT 10-AUG-2001 (first entry) 
XX 
DE 
XX 
KW 
KW 
KW 
XX 

OS Homo sapiens . 
OS Synthetic. 
XX 

PN WO200123533-A2 . 
XX 

PD 05-APR-2001. 
XX 

PF 22-SEP-2000; 2 000WO-US02 608 0 . 
XX 

PR 23-SEP-1999; 99US-01554 93P . 
PR 23-SEP-1999; 99WO-US 02 088 1 . 
PR 13-OCT-1999; 99US- 004 16901 . 
PR 0 6-DEC-19 99; 99US-01 69232 P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Gurney M, Bienkowski MJ; 
XX 

DR WPI; 2001-290516/30. 
XX 
PT 
PT 
XX 

PS Example 14; Page 186-187; 189pp; English. 



Enzymes that cleave the alpha-secretase site of the amyloid precursor 
protein, useful for the treatment of Alzheimer's disease. 



XX 

CC The present invention relates to enzymes for cleaving the alpha- 
CC secretase site of the amyloid precursor protein (APP) and methods of 
CC identifying those enzymes. The methods may be used to identify enzymes 
that may be used to cleave the alpha-secretase cleavage site of the APP 
protein. The enzymes may be used to treat or modulate the progress of 
Alzheimer's disease. The present sequence is human acid-processed form of 
Aspartyl protease-1 (Asp-1) deltaTM (His) 6 protein which is used for the 
expression of pre-pro-human-Aspartyl protease 1 (Aspl). The protein is 
obtained by replacing C-terminal transmembrane and cytoplasmic domains 
with a hexahistidine purification tag in the human Aspartyl protease 1 



CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 

SQ Sequence 413 AA; 



Query Match 100.0%; Score 2113; DB 4; Length 413; 

Best Local Similarity 100.0%; Pred. No. 1.5e-198; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Q v 1 ALE P ALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

' > I I I I I I | | | | | | | M I I I I I I ! I | | | | | | | | | | | ! [ f | | | | | | M | | ( | | | |, | ,,, , 
Db 1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

61 HS YI DTYFDTERS ST YRSKGFDVTVKYTQGSWTGFVGEDLVT 1 PKGFNTS FLVNI AT I FE 120 

MM I I I I I I I I I I II I I I I II I I I I I I M | | | | | | | | | m I I I I I I I I I I M II 

Db 61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 120 

Q y 121 SENFFLPGIKWNGILGIAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 

I ' I I I I I I II I I I I II I I I I I I I I M I I I I I I I I I I | | | | | | | | | M I I M I I I I I | | | I 
Db 121 SENFFLPGI KWNG1 LGLAYATLAKPS S SLETFFDSLVTQANI PNVFSMQMCGAGLPVAGS 180 

Q y 181 CTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 240 

''''I' I I I I I I I I I I I I M I I I I I I I I | | | | | | | | | | | | M | | | | | | | | | | | | | 

Db 181 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 24 0 

Q y 241 DSGTTLLRLPQKVFDAVVEAVARASLI PEFSDGFWTGSQLACWTNSETPWS YFPKI SI YL 300 

I I I ' I I ' I I I I I I I I I I I I I f I I I I | | I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 241 DSGTTLLRLPQKVFDAVVEAVARASLI PEFSDGFWTGSQLACWTNS ETPWS YFPKI S I YL 300 

Q y 301 RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 360 

I I I I ' I I I ' I M I I I I II I I I I I I f | | | | | I I II I I f I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 360 

Qy 36 1 QKRVGFAAS P CAE I AGAAVS E I S G P FS T EDVASNCVPAQ S LS EP I L 406 

I ' M I I E I I I I I I I I I I I | | M I I I I I I I I I I I | | | | | | | | | I I I I 
Db 361 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 406 



RESULT 3 
ABB78619 

ID ABB78619 standard; protein; 413 AA. 
XX 

AC ABB78619; 
XX 

DT 16-JUL-2002 {first entry) 
XX 

DE Asp-ldeltaTM (his) 6 acid-processed form SEQ ID NO: 68 
XX 



KW Human; Asp-1; Asp-2; aspartyl protease; Alzheimer's disease; proteolytic; 

KW chromosome 21. 

XX 

OS Homo sapiens . 
XX 

PN GB2367060-A. 
XX 

PD 27-MAR-2 0 02. 
XX 

PF 29-OCT-2001; 2001GB-00025934 . 
XX 

PR 23-SEP-1999; 9 9US-004 04 133 . 

PR 23-SEP-1999; 99US-0 1554 93P . 

PR 23-SEP-1999; 99WO-US020881 . 

PR 13-OCT-1999; 9 9US-004 1 6901 . 

PR 06-DEC-1999; 99US-0169232P . 

PR 22-SEP-2000; 2 000GB- 0 0 023315 . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Bienkowkski MJ, Gurney M; 
XX 

DR WPI; 2002-397167/43. 
XX 

PT Human aspartyl protease 1 substrates useful in assays to detect aspartyl 

PT protease activity, e.g. for the diagnosis of Alzheimer's disease. 

XX 

PS 
XX 



Example 14; Page 158-159; 182pp; English. 



CC The present invention describes a human aspartyl protease 1 (hu-Aspl) 

CC substrate (I) which comprises a peptide of no more than 50 amino acids, 

CC and which comprises the 8 amino acid sequence Gly-Leu-Ala-Leu-Ala-Leu- 

CC Glu-Pro. Also described are: (1) a method (II) for assaying hu-Aspl 

CC proteolytic activity, comprising: (a) contacting a hu-Aspl protein with 

CC (I) under acidic conditions; and (b) determining the level of hu-Aspl 

CC proteolytic activity; (2) a purified polynucleotide (III) comprising a 

CC nucleotide sequence that hybridises under stringent conditions to the non 

CC -coding strand complementary to a defined 18 04 nucleotide sequence (see 

CC ABL52456) where the nucleotide sequence encodes a polypeptide having Aspl 

CC proteolytic activity and lacks nucleotides encoding a transmembrane 

CC domain); (3) a purified polynucleotide (III*) comprising a sequence that 

CC hybridises under stringent conditions to (III) (the nucleotide sequence 

CC encodes a polypeptide further lacking a pro-peptide domain corresponding 

CC to amino acids 23-62 of hu-Aspl (see ABB78589)); (4) a vector (IV) 

CC comprising (III) or (III'); and (5) a host cell (V) transformed or 

CC transfected with (III), (III') and/or (IV). The hu-Aspl protease 

CC substrate (I) may be used as an enzyme substrate in assays to detect 

CC aspartyl protease activity, (II) and therefore diagnose diseases 

CC associated with aberrant hu-Aspl expression and activity such as 

CC Alzheimer's disease. Hu-Aspl has been localised to chromosome 21, while 

CC hu-Asp2 has been localised to chromosome llq23 . 3-24 . 1 . The present 

CC sequence represents the Asp-ldeltaTM ( his ) 6 acid-processed form amino acid 

CC sequence, which is used in an example from the present invention 

XX 

SQ Sequence 413 AA; 



Query Match 



100.0%; Score 2113; DB 5; Length 413; 



Best Local Similarity 100.0%; Pred. No. 1.5e-198; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

QY 1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

' I I I I I I I I I II I M I I | | | | | | | | | | I I I I I M | | | | | | | | | | | | | | || | I I I I | | | | | 
Db 1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

Q y 61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 120 

N I I I I I I I I I I I I M I I I I I I I I I I I I M I If I I I I I I I I | | | | I I I I f I I I I I I | | I I 
Db 61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 120 



Q y 121 SENFFLPGIKWNGILGIjAYATLAJ^PSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 

N I I I I I I M I I I I M I I I I I I I I I I I I I f | | | | | | I I I I I I I I I I I I I M I I I II I I I I 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 



Db 121 



Q y 181 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 24 0 

I I I M I I II I I I I I I I I I I II I I I I I I I I I I I I I M I I I I M I I I I I I I I I II I II I I I I 
Db 181 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 240 



^ 241 DSGTTLLRLPQKVFDAVVEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKI SI YL 300 

>> I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I | | I I I I I I I I | | | | | | M I I 

DSGTTLLRLPQKVFDAVVEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKI SI YL 300 



Db 241 



Q y 301 RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGI SPSTNALVIGATVMEGFYVI FDRA 360 

N I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I | | | | | | | | | | | | I I I 

Db 301 RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 360 



QY 3 ^1 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 

M I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | 
Q KRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS NC VP AQ S L S E P I L 4 06 



Db 361 



RESULT 4 
AAE10657 

ID AAE10657 standard; protein; 475 AA. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 



AAE10657; 

10-DEC-2001 (first entry) 

Secreted recombinant hu-Asp 1 with (His) 6 tag and lacking TM domain. 

Human; aspartyl protease 1; Aspl; amyloid precursor protein; APP; 
Alzheimer's disease; AD; dementia; neurofibrillary tangle; gliosis; 
amyloid plaque; neuronal loss; proteolytic; nootropic; neuroprotective. 

Homo sapiens . 
Synthetic . 

GB2357767-A 

04-JUL-2001 

22-SEP-2000 



23-SEP-1999. 
23-SEP-1999; 



2000GB-00023315. 

99US-00404133 . 
99US-0155493P. 



PR 23-SEP-1999; 99WO-US02 08 8 1 . 
PR 13-OCT-1999; 99US-0 04 16901 . 
PR 06-DEC-1999; 99US- 0169232P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Bienkowkski MJ, Gurney M; 
XX 

DR WPI; 2001-444208/48. 
XX 

PT Polypeptide comprising fragments of human aspartyl protease with amyloid 
PT precursor protein processing activity and alpha-secretase activity, for 
PT identifying modulators useful in treating Alzheimer's disease 
XX 

PS Example 14; Page 156-158; 187pp; English. 
XX 

CC The patent discloses human aspartyl protease 1 (hu-Aspl) or modified Aspl 
CC proteins which lack transmembrane domain or amino terminal domain or 
CC cytoplasmic domain and retains alpha-secretase activity and amyloid 
CC protein precursor (APP) processing activity. The proteins of the 
CC invention are useful for assaying hu-Aspl alpha-secretase activity, which 
in turn is useful for identifying modulators of hu-Aspl alpha-secretase 
activity, where modulators that increase hu-Aspl alpha-secretase activity 
are useful for treating Alzheimer's disease (AD) which causes progressive 
dementia with consequent formation of amyloid plaques, neurofibrillary 
tangles, gliosis and neuronal loss. Hu-Aspl protease substrate is useful 
for assaying hu-Aspl proteolytic activity, by contacting hu-Aspl protein 
with the substrate under acidic conditions and determining the level of 
hu-Aspl proteolytic activity. The present sequence is secreted 
recombinant human Asp 1 protein lacking a transmembrane (TM) domain and 
containing a (His) 6 tag. This sequence is generated from human Asp 1 
protein by the deletion of its C-terminal TM domain and addition of hexa- 
CC histidine tag at its C-terminus 
XX 

SQ Sequence 475 AA; 



CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



Query Match 100.0%; Score 2113; DB 4; Length 475; 

Best Local Similarity 100.0%; Pred. No. 1.9e-198; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


1 


ALEPALAS PAGAANFIiAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | 1 1 1 1 1 I 1 1 1 I I I I 1 1 1 1 1 1 II 1 1 1 1 1 1 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 

1 > 1 1 M 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 M | | | | | | | | | | | | | | | | | | t | f | j | | | | ,, | , | , | , 

HS YIDTYFDTERS ST YRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATI FE 


120 


Db 


123 


182 


Qy 


121 


S ENFFLPGI KWNGI LGLAYATLAKPS S SLETFFDS LVTQANI PNVFSMQMCGAGLPVAGS 

' 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I I I | 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | | | M 1 1 1 II 1 1 II 1 1 1 1 1 1 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANI PNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 I 1 f 1 1 1 1 1 1 1 1 1 | | 1 1 1 1 1 1 1 I 1 1 I I 1 1 1 1 1 1 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 



Qy 



241 DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 300 



Db 


303 






Db 


363 


Qy 


361 


Db 


423 



> f I I I M I I I I I ! I I I I I I ! I I ! | | | | I | | | | ! | | | f | | | | | | | | | | | | | | f M | | , | | , 

DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 362 
RDENS S RS FRI TI LPQLYI Q PMMGAGLNYEC YRFGI S PS TNALVI GATVMEGFYVI FDRA 360 

I ' M I I I I ! I I I I It I 1 I I I I ! I | | I | | | | | | | | I | | | | | | i m | I I I I I I M I f | | | | | 

RDENS SRS FRI TI L PQLYI QPMMGAGLNYECYRFGI S P STNAL VI GATVMEGFYVI FDRA 422 

QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
I I I I I I I I I I I M I I I I I I I I I | | | | | | M | | | | | M I I I I I I I | | 
QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 68 



Human; alpha-secretase; amyloid precursor protein; APP; therapy; 
Alzheimer's disease; antialzheimer ' s ; aspartyl protease 1; Aspl; 
beta-secretase; secreted Asp-1 deltaTM (His) 6 protein. 



RESULT 5 
AAE02609 

ID AAE02609 standard; protein; 475 AA. 
XX 

AC AAE02609; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Human secreted aspartyl protease-1 (Asp-1) deltaTM (His) 6 protein. 
XX 

KW 

KW 

KW 

XX 

OS Homo sapiens. 

OS Synthetic. 
XX 

PN WO200123533-A2 . 
XX 

PD 05-APR-2001 . 
XX 

PF 22-SEP-2000; 2 000WO-US02608 0 . 
XX 

PR 23-SEP-1999; 99US-0155493P . 

PR 23-SEP-1999; 99WO-US020881 . 

PR 13-OCT-1999; 99US-00416901 . 

PR 06-DEC-1999; 99US-0169232P . 
XX 

PA { PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Gurney M, Bienkowski MJ; 
XX 

DR WPI; 2001-290516/30. 



Enzymes that cleave the alpha-secretase site of the amyloid precursor 
protein, useful for the treatment of Alzheimer's disease. 



XX 
PT 
PT 
XX 

PS Example 14; Page 184-186; 189pp; English. 
XX 

CC The present invention relates to enzymes for cleaving the alpha- 

CC secretase site of the amyloid precursor protein (APP) and methods of 

CC identifying those enzymes. The methods may be used to identify enzymes 

CC that may be used to cleave the alpha-secretase cleavage site of the APP 

CC protein. The enzymes may be used to treat or modulate the progress of 

CC Alzheimer's disease. The present sequence is human secreted recombinant 



CC Aspartyl protease-1 (Asp-1) deltaTM (His) 6 protein which is used for the 

CC expression of pre-pro-human-Aspartyl protease 1 (Aspl). The protein is 

CC obtained by replacing C-terminal transmembrane and cytoplasmic domains 

CC with a hexahistidine purification tag in the human Aspartyl protease 1 
XX 

SQ Sequence 475 AA; 

Query Match 100.0%; Score 2113; DB 4; Length 475; 

Best Local Similarity 100.0%; Pred. No. 1.9e-198; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

' N 1 1 1 ! 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 ! 1 I 1 ! I I I I I I | | f | | | | | | | | m | f | | | | | | | | | 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLI GTPPQKLQI LVDTGS SNFAVAGT P 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
N M M 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | M 1 1 1 1 1 1 I f I | | ! | | | | | | | | | | 
HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 I I I I 

j. i. xjc u± xwvix vjx j_ivjj^Lrt. ±±/-\j\r o j_i£Li let JJo j_i v I y/\N X FN V FSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
1 N 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | m 1 1 1 1 1 1 1 1 1 1 | | | | | || 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 M 1 ! M 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 II 1 1 1 1 1 1 1 1 1 1 1 1 If 1 1 1 II 1 1 1 i 1 1 1 
DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDEN S S RS FRI T I LPQLYIQPMMGAGLNYEC YRFGI S P STNALVI GATVMEGFYVI FDRA 

N f 1 1 1 1 ! 1 f ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 I 1 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 406 
1 1 1 M M 1 1 1 1 1 1 1 I I I I | | | | | | | | | | M 1 1 1 1 1 1 I I I | | | | | | | 

QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 468 




Db 


423 





RESULT 6 
ABB78618 

ID ABB78618 standard; protein; 475 AA. 
XX 

AC ABB78618; 
XX 

DT 16-JUL-2002 (first entry) 
XX 

DE Secreted recombinant Asp-ldeltaTM (his ) 6 amino acid SEQ ID NO- 67 
XX 

KW Human; Asp-1; Asp-2; aspartyl protease; Alzheimer's disease; proteolytic; 

KW chromosome 21. 

XX 

OS Homo sapiens . 
XX 

PN GB2367060-A. 
XX 



PD 27-MAR-2002. 
XX 

PF 29-OCT-2001; 2001GB-00025934 . 
XX 

PR 23-SEP-1999; 99US- 004 04 133 . 

PR 23-SEP-1999; 99US-01554 93P . 

PR 23-SEP-1999; 99WO-US02088 1 . 

PR 13-OCT-1999; 9 9US-004 16901 . 

PR 06-DEC-1999; 9 9US-01 692 32P . 

PR 22-SEP-2000; 2000GB- 00023315 . 
XX 

PA ( PHAA ) PHARMACIA & UPJOHN CO 
XX 

PI Bienkowkski MJ, Gurney M; 
XX 

DR WPI; 2002-397167/43. 
XX 



PT Human aspartyl protease 1 substrates useful in assays to detect aspartyl 
protease activity, e.g. for the diagnosis of Alzheimer's disease. 



PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 



CC 
CC 
CC 
CC 



CC 
CC 
CC 



CC 
CC 
CC 



Example 14; Page 156-158; 182pp; English. 



The present invention describes a human aspartyl protease 1 (hu-Aspl) 
substrate (I) which comprises a peptide of no more than 50 amino acids, 
and which comprises the 8 amino acid sequence Gly-Leu-Ala-Leu-Ala-Leu- 
Glu-Pro. Also described are: (1) a method (II) for assaying hu-Aspl 
proteolytic activity, comprising: (a) contacting a hu-Aspl protein with 
CC (I) under acidic conditions; and (b) determining the level of hu-Aspl 
proteolytic activity; (2) a purified polynucleotide (III) comprising a 
nucleotide sequence that hybridises under stringent conditions to the non 
-coding strand complementary to a defined 1804 nucleotide sequence (see 
ABL52456) where the nucleotide sequence encodes a polypeptide having Aspl 
CC proteolytic activity and lacks nucleotides encoding a transmembrane 
CC domain); (3) a purified polynucleotide (III f ) comprising a sequence that 
CC hybridises under stringent conditions to (III) (the nucleotide sequence 
encodes a polypeptide further lacking a pro-peptide domain corresponding 
to amino acids 23-62 of hu-Aspl (see ABB78589)); (4) a vector (IV) 
comprising (III) or (III'); and (5) a host cell (V) transformed or 
CC transfected with (III), (III') and/or (IV). The hu-Aspl protease 
CC substrate (I) may be used as an enzyme substrate in assays to detect 
CC aspartyl protease activity, (II) and therefore diagnose diseases 
CC associated with aberrant hu-Aspl expression and activity such as 

Alzheimer's disease. Hu-Aspl has been localised to chromosome 21, while 
hu-Asp2 has been localised to chromosome llq23 . 3-24 . 1 . The present 
sequence represents a secreted recombinant Asp-ldeltaTM (his ) 6 amino acid 
CC sequence, which is used in an example from the present invention 
XX 

SQ Sequence 475 AA; 



Query Match 100.0%; Score 2113; DB 5; Length 475; 

Best Local Similarity 100.0%; Pred. No. 1.9e-198; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

1 1 ' I ! I I I I I I I I I I ! t I I I I I I I I | | | | | | | | | | | | | | | | | M | | f | | | | ( | , f | ,, | [ 
Db 63 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 122 



2V 61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 120 

N M I I I I M ! I I I I I I I I M I I I I I I M I I I M i I I I I II I I I I I I I I I I I I I i M I I I 

Db 123 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 182 

Q y 121 SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLWQANIPNVFSMQMCGAGLPVAGS 180 

>> I II I I I I I I I | | | | | | | | ! | | | | | I I I I I I I I | | | | | | | | | | | | I I I I I I I I I I I I I I 
Db 183 SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANI PNVFSMQMCGAGLPVAGS 242 

Q y 181 GTNG GSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 240 

N I I I I I I I I I I I I I I I I I I I | | | | | | | | I I I I I I I I I I M I I I I I I I I I I | I | I 

Db 243 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 302 

Qy 241 DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI YL 300 

> I I I I M I I I I I I I I I I I I I I I I I | f M I I I I II I I I I f I I I I I I I I I I I I I | | | f | | | | 

Db 303 DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 362 



Q y 301 RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 360 

I I I I M II II I II | I | | M || | | | | | | | M I M I I I I I I I I | | | | | | | | | | | | | | 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVI FDRA 422 



Db 363 



Qy 361 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 406 

i ' N I I I I I I I I I | | | | | | | I I I I I I I I I I I I I I I I I I I I 

Db 423 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 468 



03-SEP-2001 (first entry) 

Human colon cancer antigen protein SEQ ID NO: 6356 



RESULT 7 
AAG75592 

ID AAG75592 standard; protein; 481 AA. 
XX 

AC AAG75592; 
XX 
DT 
XX 
DE 
XX 

KW Human; colon cancer; colon cancer antigen; diagnosis; detection; 

KW colorectal carcinoma; chromosome 21. 

XX 

OS Homo sapiens. 
XX 

PN WO200122920-A2. 
XX 

PD 05-APR-2001. 
XX 

PF 28-SEP-2000; 2000WO-US026524 . 
XX 

PR 29-SEP-1999; 99US-0157137P . 
PR 03-NOV-1999; 99US-0163280P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Ruben SM, Barash SC, Birse CE, Rosen CA; 
XX 

DR WPI; 2001-235357/24. 
DR N-PSDB; AAH34997. 
XX 

PT Nucleic acids encoding 4277 human colon cancer-associated polypeptides, 



cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 



PT useful for preventing, diagnosing and/or treating colorectal cancers. 
XX 

PS Claim 11; Page 7815-7817; 9803pp; English. 
XX 

CC AAH32943 to AAH37195 and AAG73514 to AAG77788 represent human colon 

cancer-associated nucleic acid molecules (N) and proteins (P), where the 
proteins are collectively known as colon cancer antigens. The colon 
cancer antigens have cytostatic activity and can be used in gene therapy 
and vaccine production. N and P may be used in the prevention, diagnosis 
and treatment of diseases associated with inappropriate P expression For 
example, N and P may be used to treat disorders associated with decreased 
expression by rectifying mutations or deletions in a patient's genome 
that affect the activity of P by expressing inactive proteins or to 
CC supplement the patients own production of P. Additionally, N may be used 
to produce the colon cancer-associated Ps, by inserting the nucleic acids 
into a host cell and culturing the cell to express the proteins. N and P 
can be used in the prevention, diagnosis and treatment of colorectal 
carcinomas and cancers. AAH37196 to AAH37204 and AAB77789 represent 
sequences used in the exemplification of the present invention N B 
Pages 666 to 682 and page 7053 of the sequence listing were missing at 
time of publication, meaning no sequences are present for SEO ID NO-10?7 
CC to 1052, 7921 and 7922 
XX 

SQ Sequence 481 AA; 

Query Match 100.0%; Score 2113; DB 4; Length 481; 

Best Local Similarity 100.0%; Pred. No. 1.9e-198; 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; Gaps 0; 

Q ¥ 1 -^LEPALAS PAGAANFIAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

Db 26 ALE PALAS PAGAANFLAMVDNLQGDS GRGY YLEMLI GT P PQKLQ I LVDTGS SN FAVAGT P 85 

Q y 61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATIFE 120 

1 1 1 1 1 ' I I I M I I I I I I I I I | | M I I I I I I I I I I M ! I I I I I I I I I I f I | | | | If 

Db 86 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 145 

Qy 121 SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 

, Ar N I M I I I | | | | | | | | | | | | | | | | M I I I I I I | | | | | M I I I I I I I I I | | | | | M 

Db 146 SENFFLPGIKWNGILGIxA.YATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 205 

QY 181 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQ1EILKLEIGGQSLNLDCREYNADKAIV 240 

1 1 1 1 1 1 I ' I I I I I I I I I I I I I I I I t | | | | | | | | | | | | I | | | | | | | || 

Db 206 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 265 

Q y 241 DSGTTLLRLPQKVFDAVVELAVARASLI PEFSDGFWTGSQLACWTNSETPWS YFPKI SI YL 300 

_ I I I I I I I I I M I f I I I I I M I I I I I I I I I I I I I I I I I | | | | | I | | | | I I I I I I I 

Db 266 DSGTTLLRLPQKVFDAVVEIAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI YL 325 

Q y 301 PDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVI FDRA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | , | | | | | | | | | || 

Db 326 RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVI FDRA 385 

®y 361 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 406 

f ' I I I 1 I I I I I I I M I I I I I I I I I | | | | | | | | | | | I f I I I I I I I | | 
Db 386 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 431 



RESULT 8 
AAW61362 

ID AAW61362 standard; protein; 518 AA. 
XX 

AC AAW613 62; 
XX 

DT 25-MAR-2 003 (revised) 

DT 25-SEP-1998 (first entry) 

XX 

DE Aspartic proteinase ASP1. 
XX 

KW ASP1; Aspartic proteinase; Alzheimer's disease; cancer; melanoma. 
XX 

OS Homo sapiens. 
XX 

PN EP848062-A2. 
XX 

PD 17-JUN-1998. 
XX 

PF 01-DEC-1997; 97EP-0030964 8 . 
XX 

PR 14-DEC-1996; 96GB-00 02 6022 . 

PR 06-OCT-1997; 97US-00999723 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Powell DJ, Southan C, Chapman CG, Evans JR; 
XX 

DR WPI; 1998-314477/28. 

DR N-PSDB; AAV27962. 
XX 

PT New isolated polynucleotide encodes Aspartic protease polypeptide - used 

PT to diagnosis, treat and vaccinate against Alzheimer's disease, cancer and 

PT melanoma . 
XX 

Claim 11; Page 7; 19pp; English. 



PS 
XX 
CC 

cc 



The human ASP1 protein is structurally related to other proteins of the 
Aspartic proteinase family. ASP 1 polypeptides and polynucleotides can be 
CC used to diagnosis, treat and vaccinate against Alzheimer's disease, 
CC cancer and melanoma. (Updated on 25-MAR-2 003 to correct PR field ) 
XX ' 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2113; DB 2; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-198; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ALEPALAS PAGAANFLAMVDNLQGDSGRG YYLEMLI GTPPQKLQI LVDTGS SNFAVAGTP 60 

1 1 1 1 1 I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | , | 
Db 63 ALE PALAS PAGAAN FLAMVDNLQGDS GRG YYLEML I GTP PQKLQ I LVDTGS SNFAVAGT P 122 

Q y 61 HS YIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGBT^ITSFLVNIATI FE 120 

I I I I I I I I I I I I I I | | I | | | | | | | | | M I I f I I I I I I | | | f | | M I f| I I I I I I I 

Db 123 HSYIDTYFDTERSSTYRSKGFDVT VKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 182 



Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 
1 I M 1 1 1 M 1 I I 1 1 I M 1 I 1 1 1 1 1 1 | 1 | | | | | | | | | I | | | | | | | | j | | | | | f | ,, | |, 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

1 I 1 1 1 1 1 I I I 1 1 1 1 I 1 1 1 f 1 I 1 II 1 ! 1 1 1 1 1 1 1 1 ! 1 I 1 I f | | I | | | | | | | | f | | | M t | | 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

< I 1 1 M 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 I I M M 1 1 1 II 1 1 1 1 1 1 1 I 1 M 

DSGTTLLRLPQWFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPOLYIOPMGAGLNYFrYRFRT^P^TMaTVTrTiTiTMrrirvT t~t r-r^* 

-1--UJ. ^-u± j.yj.111 ±vjj-vvj jjih _l ± r\c Ol OiO J. 1M r\ 1 i v X Lj/\x V JMiliCjir x VI FDRA. 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGI SPSTNALVIGATVMEGFYVIFDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 

1 1 1 1 N 1 1 1 1 1 I I I I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 
QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 468 




Db 


423 





21-SEP-1999 (first entry) 
Human aspartyl protease, CSP56. 



RESULT 9 
AAY13799 

ID AAY13799 standard; protein; 518 AA. 
XX 

AC AAY13799; 
XX 
DT 
XX 
DE 
XX 

KW CSP56; human; aspartyl protease; diagnosis; neoplasia; tumour; 

KW breast tumour; colon tumour. 

XX 

OS Homo sapiens . 
XX 

PN W09933963-A1. 
XX 

PD 08- JUL-1999 . 
XX 

PF 14-DEC-1998; 98WO-US 02 654 7 . 
XX 

PR 31-DEC-1997; 97US-0070112P . 
XX 

PA (CHIR ) CHIRON CORP. 
XX 

PI Giese KW, Xin H; 
XX 

DR WPI; 1999-430240/36. 
DR N-PSDB; AAX89297. 
XX 
PT 
XX 

PS Claim 2; Fig 2A; 51pp; English. 
XX 
CC 



Human CSP56 protein for diagnosis of neoplasia. 



This represents a human CSP56 protein, a novel aspartyl protease. The 



CC CSP56 protein can be used in methods for diagnosing neoplasia, for 

CC determining the metastatic potential of a tumour, and for screening test 

CC compounds for the ability to suppress the metastatic potential of a 

CC tumour. The tumours are preferably from breast or colon 

XX 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2113; DB 2; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-198; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0- 



Qy 


1 


ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

1 1 1 1 > 1 1 1 1 1 1 1 1 1 1 ! 1 M 1 f M 1 M 1 1 1 1 1 I I I I M | 1 I 1 1 I I | | | M | | | | | | 

ALE PALAS PAGAAN FLAMVDNLQGDS GRG YYLEMLI GT P PQKLQ I LVDTGS SN FAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
1 1 1 1 " 1 1 1 I i 1 1 M 1 1 1 II 1 1 1 1 1 1 M | | | | | | | | | | | | | | | | | | | | | | , | | |, 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


S ENFFL P GI KWNGI LGLAYATLAKP S S S LET FFDS LVTQAN I PNVFSMQMCGAGLP VAGS 
1 1 ' 1 1 1 1 I 1 1 1 1 1 1 f 1 I I I I | | | | | | | | | | | | | | | | | | | | | | | | j ,,,,,,,,,,,,,, , 
S ENFFLPGI KWNGI LGLAYATLAK P S S ST.PTFFn^T wnaMT o^n/iPcwwr-'^Tv /~t m r^^o 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
1 ' 1 ' 1 I I 1 ! 1 1 1 1 1 1 1 1 1 1 I | I | | | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1! 1 1 1 1 1 1 1 1 1 I 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 > 1 'M 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f | | | | | | 1 I 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M f 
DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENS S RS FRIT I LPQLYI QPMMGAGLN YEC YRFGI S P STNALVI GATVMEGFYVI FDRA 

1 1 " 1 I M 1 1 1 1 1 | 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 I I I | | 1 1 1 | | || | | | M 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 

> 1 1 1 N 1 II II M 1 1 1 I I I | | | | | II 1 1 II 1 1 1 1 1 1 I 1 1 I 1 1 1 1 II 
QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 68 




Db 


423 





RESULT 10 
AAY22239 

ID AAY22239 standard; protein; 518 AA 
XX 

AC AAY22239; 
XX 
DT 
XX 
DE 
XX 
KW 
KW 



20-SEP-1999 (first entry) 

Human CSP56, aspartyl-type protease, protein sequence. 



Metastatic marker protein; human; cancer metastasis; breast cancer; 
colon cancer; diagnosis; therapy; tumour; metastatic potential; CSP56; 
KW aspartyl-type protease. 
XX 

OS Homo sapiens. 
XX 

PN WO9934004-A2 . 



XX 

PD 08-JUL-1999. 
XX 

PF 24-DEC-1998; 98WO-US027 608 . 
XX 

PR 31-DEC-1997; 97US-007 01 12P . 
XX 

PA (CHIR ) CHIRON CORP. 
XX 

PI Xin H , Giese K; 
XX 

DR WPI; 1999-430248/36. 
DR N-PSDB; AAX84708. 
XX 

PT New polynucleotides associated with cancer metastasi, 
XX 

PS Claim 4; Page 78-80; 80pp; English. 
XX 



cc 
cc 
cc 
cc 



This sequence represents a polypeptide of the invention, and is an 
aspartyl-type protease, designated CSP56 . The polynucleotides (PNs) of 
the invention encode metastatic marker protein variants. The PNs and 
polypeptides can be used as markers for cancer metastasis. The products 
CC can be used for identifying metastatic tissue or metastatic potential of 
CC a tissue, e.g. breast or colon tissue. They can also be used for 

screening test compounds for the ability to suppress the metastatic 
potential of a tumour. The products can be used for developing products 
for the therapy of cancers, particularly breast or colon cancer 



CC 
CC 
CC 
XX 

SQ Sequence 518 AA; 



Query Match 100.0%; Score 2113; DB 2; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-198; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ALEPALAS PAGAANFLAMVDNLQGDS GRGYYLEMLI GTPPQKLQI LVDTGS SNFAVAGT P 60 

„ I M I I I I I I II II I I || I | M I I I I I I I M I I I I | | | | | | | | | | | | | | | | || || | 

Db 63 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 122 

Q y 61 HSYIDTYFD TERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 120 

1 1 1 1 1 I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | , | | | | 

Db 123 HSYIDTY FDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 182 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 18 0 

1 M 1 ' I I I I I I I I I I I I I M I I I I I I M I I I M I I I I I I I I I I | | | M I I I I I I I I I I I I 
SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANI PNVFSMQMCGAGLPVAGS 2 42 

Qy 181 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 24 0 

1 1 1 1 1 1 > M I I I I I ! I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | I I I I I I I I I I I I I 
Db 243 G TNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 302 

Q y 241 DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI YL 300 

Db w 1111111 LI 11 " Ml 11 " I 'I MINI UN M I II II Ml II II I I II I I III MM 

Dt> 303 DSGTTLLRLPQKVFDAVVE1AVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI YL 362 

Qy 301 ^DENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVIFDRA 360 

1 1 f I M II I II If II I I II I I I I I I II I II II I I I I I I I I || M I I I I I I | | Ml 

Db 363 RDENS S RS FRI TI LPQLYIQPMMGAGLNYEC YRFGI S PSTNALVI GATVMEGFYVI FDRA 422 



Qy 121 
Db 183 



Qy 361 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 406 

N I I I I I t I I M I I I I I I I I I i I I | | | | | | | | | | | | ( | | | | | | | | | 
Db 423 QKRVG FAAS P CAEI AGAAVS E I S G P FS T ED VASNCVPAQ S L S E P I L 468 



07-DEC-1999 



RESULT 11 
AAY41714 

ID AAY41714 standard; protein; 518 AA 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
XX 
PN 
XX 



AAY41714; 

(first entry) 
Human PR0852 protein sequence. 



Human; PRO; EST; expressed sequence tag; PCR primer; hybridisation- 
probe; blood coagulation disorder; cancer; cellular adhesion disorder; 
secreted protein; transmembrane protein. 

Homo sapiens. 
W09946281-A2 . 



PD 


16 


-SEP 


-1999 






XX 












PF 


08 


-MAR 


-1999; 


_ j if* \y 




XX 












PR 


10 


-MAR- 


-1998; 


98US 


-0077450P. 


PR 


11 


-MAR- 


-1998; 


98US 


-0077632P. 


PR 


11 


-MAR- 


-1998; 


98US- 


-0077641P. 


PR 


11 


-MAR- 


-1998; 


98US- 


-0077649P. 


PR 


12 


-MAR- 


-1998; 


98US- 


-0077791P. 


PR 


13 


-MAR- 


-1998; 


98US- 


-0078004P. 


PR 


17 


-MAR- 


-1998; 


98US- 


-00040220. 


PR 


20- 


-MAR- 


-1998; 


98US- 


-0078886P . 


PR 


20- 


-MAR- 


-1998; 


98US- 


-0078910P. 


PR 


20- 


-MAR- 


-1998; 


98US- 


-0078936P. 


PR 


20- 


-MAR- 


-1998; 


98US- 


-0078939P. 


PR 


25- 


-MAR- 


-1998; 


98US- 


-0079294P . 


PR 


26- 


-MAR- 


-1998; 


98US- 


-0079656P. 


PR 


27- 


-MAR- 


-1998; 


98US- 


-0079663P. 


PR 


27- 


-MAR- 


-1998; 


98US- 


-0079664P. 


PR 


27- 


-MAR- 


-1998; 


98US- 


-0079689P. 


PR 


27- 


-MAR- 


1998; 


98US- 


■0079728P. 


PR 


27- 


-MAR- 


1998; 


98US- 


-0079786P. 


PR 


30- 


-MAR- 


1998; 


98US- 


0079920P. 


PR 


30- 


-MAR- 


1998; 


98US- 


0079923P. 


PR 


31- 


-MAR- 


1998; 


98US- 


0080105P. 


PR 


31- 


-MAR- 


1998; 


98US- 


0080107P. 


PR 


31- 


-MAR- 


1998; 


98US- 


0080165P. 


PR 


31- 


-MAR- 


1998; 


98US- 


0080194P. 


PR 


01- 


-APR- 


1998; 


98US- 


0080327P. 


PR 


01- 


APR- 


1998; 


98US- 


0080328P. 


PR 


01- 


APR- 


1998; 


98US- 


0080333P. 


PR 


01- 


APR- 


1998; 


98US- 


0080334P. 


PR 


08- 


APR- 


1998; 


98US- 


0081049P. 



PR 08-APR-1998; 98US-0 08107 OP . 

PR 08-APR-1998; 98US-0 08107 IP . 

PR 09-APR-1998; 98US-0 081195P . 

PR 09-APR-1998; 98US-0 0812 03P . 

PR 09-APR-1998; 98US-0081229P . 

PR 15-APR-1998; 98US-0081817P . 

PR 15-APR-1998; 98US-0081838P . 

PR 15-APR-1998; 98US-008 1 952P . 

PR 15-APR-1998; 98US- 008 1955P . 

PR 21-APR-1998; 98US-0082568P . 

PR 21-APR-1998; 98US-0082569P . 

PR 22-APR-1998; 98US-0082 700P . 

PR 22-APR-1998; 98US- 0082 704 P . 

PR 22-APR-1998; 98US-0082804P . 

PR 23-APR-1998; 98US- 00827 67P . 

PR 23-APR-1998; 98US-00827 96P . 

PR 27-APR-1998; 98US-0083336P . 

PR 28-APR-1998; 98US-0083322P . 

PR 29-APR-1998; 98US-0083392P . 

PR 29-APR-1998; 98US-00 834 95P . 

PR 29-APR-1998; 98US- 008 34 96P . 

PR 29-APR-1998; 98US-008 34 99P . 

PR 29-APR-1998; 9 8US-008 3500P . 

PR 29-APR-1998; 9 8US-008 354 5P . 

PR 29-APR-1998; 98US-0083554P . 

PR 29-APR-1998; 98US-0083558P . 

PR 29-APR-1998; 98US-008355 9P . 

PR 30-APR-1998; 98US-008 3742P . 

PR 05-MAY-1998; 98US-0084366P . 

PR 06-MAY-1998; 98US-0084414P . 

PR 06-MAY-1998; 98US-00844 4 IP . 

PR 07-MAY-1998; 98US- 00 84598P . 

PR 07-MAY-1998; 98US-0084600P . 

PR 07-MAY-1998; 98US-0084627P . 

PR 07-MAY-1998; 98US-0084637P . 

PR 07-MAY-1998; 98US-0084639P . 

PR 07-MAY-1998; 98US-0084640P . 

PR 07-MAY-1998; 98US-0084 643P . 

PR 13-MAY-1998; 98US-0085323P . 

PR 13-MAY-1998; 98US-0085338P . 

PR 13-MAY-1998; 98US-0085339P . 

PR 15-MAY-1998; 98US-0085573P . 

PR 15-MAY-1998; 98US-0085579P . 

PR 15-MAY-1998; 98US-0085580P . 

PR 15-MAY-1998; 98US-0085582P . 

PR 15-MAY-1998; 98US-008 568 9P . 

PR 15-MAY-1998; 98US-0085697P . 

PR 15-MAY-1998; 98US- 0085700P . 

PR 15-MAY-1998; 98US-0085704P . 

PR 18-MAY-1998; 98US-008 6023P . 

PR 22-MAY-1998; 98US-0086392P . 

PR 22-MAY-1998; 98US-00864 14P . 

PR 22-MAY-1998; 98US-0086430P . 

PR 22-MAY-1998; 98US-008 6486P . 

PR 28-MAY-1998; 9 8US-0 08 7 098P . 

PR 28-MAY-1998; 98US-0087 106P . 

PR 28-MAY-1998; 98US-0087208P . 



PR 30-JUL-1998; 98US-0094 651P . 

PR ll-SEP-1998; 98US-0100038P . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Wood WI, Goddard A, Gurney A, Yuan J, Baker KP, Chen J; 
XX 

DR WPI; 1999-551358/46. 

DR N-PSDB; AAZ34056. 
XX 

PT New secreted and transmembrane polypeptides and their polynucleotides 

PT useful for treating blood coagulation disorders, cancers and cellular' 

PT adhesion disorders. 
XX 

Claim 12; Fig 73; 530pp; English. 



PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



The present invention describes secreted and transmembrane polypeptides 
and their polynucleotides. The nucleotide sequences are useful as sources 
of probes, primers, for chromosome mapping, and for generation of 
antisense sequences. They can also be used to create transgenic animals. 
The proteins can be used to treat a variety of diseases and disorders 
depending on their function. Diseases that may be treated include blood 
coagulation disorders, cancers and cellular adhesion disorders. They may 
also be used to raise antibodies. AAZ33891 to AAZ34338, and AAY41685 to 
AAY41774 represent polynucleotide and polypeptide sequence given in the 
CC exemplification of the present invention 
XX 

SQ Sequence 518 AA; 



Query Match 100.0%; Score 2113; DB 2; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-198; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



Qy 


1 


ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

' 1 1 1 1 1 1 1 1 I I I | | | | | | | | M II 1 1 ! 1 1 1 1 I I I | | | f | M 1 1 1 1 1 I I I | M I 1 I 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HS YI DTYFDTERS ST YRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNI ATI FE 

1 M 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | 1 1 1 1 1 1 | M 1 1 1 I I I I | | | | | II 1 II 1 1 1 1 
HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

1 " 11 1 I 1 1 1 1 1 1 1 1 1 I I ! | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
" 1 1 ' 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 | | | | | | | | | | | M | 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I I M 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 | | | | M 1 1 1 1 1 1 1 1 1 
DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENSS RS FRI TI L PQL YI QPMMGAGLN YEC YRFGI S P STNALVI GATVMEGFYVI FDRA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I ! I I II 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I I I | | | | | II 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVI FDRA 


360 


Db 


363 


422 



Q y 361 QKRVGFAAS P C AE I AGAAVS E I S G P FS T EDVAS NCVPAQ S L S E P I L 4 06 

I I I I I I I M I I I f I I I I I I I I I I I I I I I I I I I I I | ( | | | | | | j | M 
Db 423 QKRVGFAAS PC7VE I AGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 68 



RESULT 12 
AAY88424 

ID AAY88424 standard; protein; 518 AA. 
XX 

AC AAY88424; 
XX 

DT 03-AUG-2000 (first entry) 
XX 

DE Human aspartyl protease 1 (Aspl) amino acid sequence 
XX 

KW Aspartyl protease; aspartase; amyloid precursor protein; APP; Asp 1- 

KW Alzheimer's disease; beta secretase site. 

XX 

OS Homo sapiens. 
XX 

PN WO200017369-A2 . 
XX 

PD 30-MAR-2000. 
XX 

PF 23-SEP-1999; 99WO-US02 08 8 1 . 
XX 

PR 24-SEP-1998; 98US-0101594P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Gurney ME, Bienkowski MJ, Heinrikson RL, Parodi LA, Yan R; 
XX 

DR WPI; 2000-303209/26. 
DR N-PSDB; AAA15661. 
XX 
PT 
PT 
PT 
XX 
PS 
XX 
CC 

cc 

CC 
CC 
CC 



CC 
CC 



New enzyme designated human aspartase useful in research into Alzheimer's 
Disease is capable of cleaving amyloid protein precursor at the beta 
secretase site to produce amyloid beta peptide. 

Claim 54; Fig 1; 183pp; English. 



This sequence represents the human aspartyl protease amino acid sequence. 
The invention relates to a protease capable of cleaving the beta 
secretase site of amyloid precursor protein (APP ) . The protease contains 
a sequence encoding the amino acid sequence DTG and a sequence encoding 
DSG or DTG separated by 100-300 amino acids. When mutated the APP gene 
CC causes an autosomal dominant form of Alzheimer's disease. APP localises 
CC to the cell surface membrane and have a single C-terminal transmembrane 
domain. Proteolytic processing of APP produces the amyloid beta protein 
which is possibly very important in Alzheimer's disease. The invention 
CC includes a nucleotide sequence encoding the protease, a vector containing 
CC the nucleotide sequence, and a cell line comprising the vector. Methods 

for screening for inhibitors of beta secretase activity are also given in 
the invention. The human aspartase protein and nucleotide sequences and 
the methods for identifying inhibitors of the protease, are useful in the 
treatment of and research in to Alzheimer's disease 



CC 
CC 
CC 
XX 



SQ 



Sequence 518 AA; 



Query Match 100.0%; Score 2113; DB 3; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-198; 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; Gaps 0; 



Qy 


1 


ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 
1 1 1 1 1 > f I 1 1 1 1 1 1 ! 1 1 1 I I f j | | | | | | | | | f I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I I | | | 

ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


DO 


63 


122 


Qy 


61 


HSYTDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
' 1 ' 1 " 1 1 1 1 1 1 1 f 1 1 1 1 1 ! | | j | | | | | | | | | | | | | f | | | | | | | | ,| ,,,,,,,,,,,, , 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

' 1 1 1 > 1 1 1 1 1 1 1 1 1 f 1 1 I 1 f| 1 M 1 1 M 1 1 1 1 M 1 1 1 1 f 1 ! 1 I I M M 1 1 I 1 1 1 I I ! | f | 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

1 1 1 1 1 > 1 1 II f 1 1 1 1 1! 1 1 1 1 1 f I 1 I I I I | | | | | | | | f| 1 1 1 1 I 1 1 1 j I I M | 1 1 1 1 1 1 1 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 M 1 1 1 M 1 1 1 1 1 1 
DSGTTLLRLPQKVFDAVVT1AVARASLIPEFSDGFWTGSQLACWTNSETPWS YFPKISI YL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVI FDRA 
1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 I I 1 1 1 1 1 1 | | | | | | | | 1 1 M 1 1 1 1 1 1 1 I I I I | | | | 
RDENSS RS FRI T I LPQLYIQPMMGAGLN YEC YRFGI S P STNALVI GAT VMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
1 N 1 1 1 1 1 1 1 II 1 | | | | | | | | | | | | M 1 1 1 1 1 1 1 1 I | | | | | | | | | | 

QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 468 




Db 


423 





RESULT 13 




AAB44270 




ID 


AAB44270 standard; protein; 518 AA. 




XX 






AC 


AAB44270; 




XX 






DT 


08-FEB-2001 (first entry) 




XX 




DE 


Human PR0852 (UNQ418) protein sequence 


SEQ ID NO: 196. 


XX 


KW 


Human; secreted protein; transmembrane 


protein; PRO; 


KW 


expressed sequence tag; detection; cancer. 


XX 






OS 


Homo sapiens. 




XX 






PN 


WO200053756-A2. 




XX 






PD 


14-SEP-2000. 




XX 






PF 


18-FEB-2000; 2 000WO-US004 34 1 . 




XX 






PR 


08-MAR-1999; 9 9WO-US00502 8 . 





PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

XX 

PA 

XX 

PI 

PI 

PI 

PI 

PI 

XX 

DR 

DR 

XX 

PT 

PT 

PT 
XX 

PS 
XX 
CC 

cc 

CC 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



12 -MAR 
2 9 -MAR- 
21-APR- 
2 8 -APR- 
14-MAY- 
2 3-JUN- 

2 6- JUL- 

29- OCT- 

3 0-NOV- 
02-DEC- 
02-DEC- 
16-DEC- 

30- DEC- 
30-DEC- 

05- JAN- 

06- JAN- 
06- JAN- 



-1999 
-1999 
■1999 
■1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
1999 
2000 
2000 
2000 



99US 
99US 
99US 
99US- 
99US- 
99US- 
99US- 
99US- 
99WO- 
99WO- 
99WO- 
99WO- 
9 9WO- 
9 9WO- 
2 00 0WO- 
2 00 0WO- 
2 000WO- 



-0123957P. 
-0126773P, 
-0130232P. 
-0131445P. 
-0134287P. 
-0141037P. 
-0145698P. 
-0162506P. 
-US028313. 
-US028551. 
-US028565. 
-US030095. 
-US031243. 
-US031274 . 
-US000219. 
-US000277 . 
US000376. 



(GETH ) GENENTECH INC. 

Ashkenazi AJ, Baker KP, Botstein D, Desnoyers L, Eaton DL • 

Ferrara N, Filvaroff E, Fong S, Gao W, Gerber H, Gerritsen ME * 

Gocidard A, Godowski PJ, Grimaldi CJ, Gurney AL, Hillan K J * 

Kljavin IJ, Kuo SS, Napier MA, Pan J, Paoni NF, Roy MA, 'shelton DL; 

Stewart TA, Tumas D, Williams PM, Wood WI ; 

WPI; 2000-611443/58. 
N-PSDB; AAC78500. 

Novel PRO polypeptides and polynucleotides used in detection methods to 
target bioactive molecules to specific cells, and to modulate cellular 
activities . 

Claim 12; Fig 73; 636pp; English. 

AAC78458 to AAC78599 represent polynucleotide and EST (expressed sequence 
tag) sequences which encode secreted or transmembrane PRO polypeptides 
The PRO polynucleotides and polypeptides have cytostatic activity. The 
polynucleotides and polypeptides can be used for detecting the presence 
of PRO polypeptides in samples, for linking bioactive molecules to cells 
and for modulating biological activities of cells, using the polypeptides 
for specific targeting. The polypeptide targeting can be used to kill the 
target cells, e.g. for the treatment of cancers. The polypeptide pairs 
provide specific targeting of bioactive molecules to cells. AAC78600 to 
AAC78987 represent PCR primers and probes used in the isolation of the 
PRO polynucleotide sequences 

Sequence 518 AA; 



Query Match 100.0%; Score 2113; DB 3; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-198; 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; Gaps 0; 

Qy 1 ^LEPALAS P^ Gj ^^FLAMVI)NLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

„ I f I M I I I I I I I I I I I I I M I I I I M I I I i i M i i i i M , i i i i i i i , , ,, , 

D]D 63 ^IjEPALAS PAGAANFIAMVDNLQGDSGRGYYLEMLI GTPPQKLQI LVDTGS SNFAVAGT P 122 



C>\7 


& i 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
> >ll II II II 1 MINI Mill INIIIM M || Mill ||,|| ,,,,,,,,,,,,,,,, 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVN1AT1FE 


120 


Db 


100 
1Z j 


182 


wy 




SENFFLPGIKWNGILGIAYATIAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 
1 f 1 1 1 1 1 > 1 1 I 1 i 1 f 1 1 1 1 1 1 1 I 1 1 1 1 1 f 1 1 1 f 1 f | | | | | | | | | M 1 1 M 1 1 II II II 1 1 
SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


LJkJ 


loo 


242 




1 O I 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

1 1 1 11 1 ' 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 II 1 1 1 1 1 I I | 1 1 1 1 1 I M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


LJVJ 




302 


wy 




DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 ' 11 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | M 1 1 1 1 1 1 I I | | | | M M 1 II 1 1 1 II 1 1 1 

DSGTTLLRLPQKVFDAVVEAVARASLI PEFSDGFWTGSQLACWTNSETPWS YFPKI SI YL 


300 


iJJD 


o n 0 


362 


<_y 


301 


RDENSS RS FRI TI LPQL YI QPMMGAGLN YEC YRFGI S PS TNALVI GATVMEGFYVI FDRA 

1 1 I 1 1 1 1 1 1 1 1 | | | | | | | | || | | | | | | | I || | | 1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 , 1 1 . | , 

11 ' 1 1 1 1 1 1 ' 1 1 1 ' 1 * 1 t 1 1 1 1 1 I I I f I I | | { | | | | | | [ | | | | | | | 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAAS PCAE I AGAAVS E I S G P FS T ED VAS NCVPAQ S L S E P I L 4 06 

' N 1 M 1 1 1 1 I I I I | | | M | | | | | | | | | | | | | j | j | 1 , | | | || | | , 

QKRVGFAAS PCAEIAGAAVS EI SGPFSTEDVASNCVPAQSLSEP I L 468 




Db 


423 





RESULT 14 
AAU07201 

ID AAU07201 standard; protein; 518 AA. 
XX 

AC AAU07201; 
XX 
DT 
XX 
DE 
XX 
KW 
KW 



24-OCT-2001 (first entry) 

Human aspartyl protease 1 (Asp-1) 



Human; aspartyl protease 1; Asp-1; nootropic; neuroprotective; 
aspartyl protease 2; Asp2 ; amyloid protein precursor; APP; 

KW beta-secretase; Alzheimer's disease. 
XX 

OS Homo sapiens. 
XX 

PN WO200149097-A2. 
XX 

PD 12-JUL-2001 . 
XX 

PF 09-MAY-2001; 2 001WO-IB0 007 97 . 
XX 

PR 09-MAY-2001; 200 1WO-IB0 007 97 . 
XX 

PA (BIEN/) BIENKOWSKI M J. 

PA (GURN/) GURNEY M E. 

PA (HEIN/) HEINRIKSON R L. 

PA (PARO/) PARODI L A. 

PA (YANR/) YAN R. 

XX 

PI Bienkowski MJ, Gurney ME, Heinrikson RL, Parodi LA, Yan R; 
XX 



DR WPI; 2001-502548/55 
DR N-PSDB; AAS11701. 



XX 
PT 
PT 
PT 



Novel purified polypeptide comprising fragment of mammalian aspartvl 
protease 2, lacking Asp2 transmembrane domain and retaining beta 
PT activity aCt±Vity ° f Asp2 useful for identifying inhibitors of As P 2 
XX 

PS Example 2; Fig 1; 185pp; English. 
XX 



CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



The invention relates to a novel purified polypeptide comprising a 
fragment of mammalian aspartvl protease 2 (Asp2) protein which lacks the 
Asp2 transmembrane domain and the As P 2 protein, and where the polypeptide 
orletn f " gmeilt f e * a * n . the beta-secretase activity of the mammallL Asp2 
protein. Also included is an isoform of amyloid protein precursor (APP) 

APp P c^eavaae sitf ^ *ZVT* ^ ^ AP? °" ±tS f «9ment containing an 

APP cleavage site recognisable by a mammalian beta-secretase, and further 
comprising two lysine residues at the carboxyl terminus of the amino acid 
sequence of the mammalian APP or APP fragment. The polypeptides are used 
for assaying for modulators of beta-secretase activity; identifying 
agents that inhibit the APP processing activity of human Asp2 aspartvl 
protease (Hu-Asp2); identifying agents that modulate the activity of Asp2 
; and for reducing cellular production of amyloid beta (Abeta) from APP 
Agents identified by the above methods are useful for treating 



rr ^ h ^^ er,s ^ d i sease ^ and for identifying modulators of amyloid-beta 
CC (Abeta) peptide production, for use in designing therapeutics for the 



CC 
CC 



*f!f . m ! n l° r P revent f on of Alzheimer's disease. Probes and primers 



derived from Asp nucleic acid sequences are useful for detecting Hu-Asp 
CC nucleic acids m in vitro assays and in Northern and Southern blots. The 
CC present sequence represents the amino acid sequence of human Asp-1 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2113; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-198- 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; Gaps 0; 

QY 1 ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 
n , I I I I I I I I I I I I I I I I I I I | | | || | | | M I I I I I I I | | | | | | | | | | | | || | | | | I 

Db 63 ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 122 

QY 61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 120 

1 I I I I I I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 123 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 182 

QY 121 SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 

loo 1 1 1 I I I I I I I II I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 183 SENFFLPGIKWNGILGLAYATI^PSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 242 



QY 181 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 240 

nK I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | | | | || | 

Db 243 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 3 02 

Qy 241 DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI YL 300 

n . „ ' 1 1 1 1 11 1 1 1 I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I | | | | | | | | | | | | | | | | | 

Db 303 DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 362 



QY 301 R DENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVI FDRA 3 

I I I I I I I I I | | I | | | | | | | | | | | | | | , | | | | | , , , I I I I I I I | | I I I I I 

Db 3 63 RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVI FDRA 4 

Qy 361 QKRVG FAAS P CAE I AGAAVS EI S GP FS T EDVAS NCVPAQ S L S E P I L 406 

1 I I I I I I I I I I | | | | M I I I I I I I I | | | | | | | | | | | | | t | | 

Db 423 QKRVG FAAS P CAE I AGAAVS E I S GP FS T EDVAS NCVPAQ S L S E P I L 4 68 



RESULT 15 
AAE10628 
ID 
XX 
AC 
XX 



DT 

XX 

DE 

XX 

KW 

KW 

KW 

KW 

XX 

OS 

XX 

FH 

FT 

FT 

FT 

FT 

FT 

FT 

XX 

PN 

XX 
PD 

XX 

PF 

XX 

PR 

PR 

PR 

PR 

PR 

XX 

PA 

XX 

PI 

XX 

DR 

DR 

XX 

PT 

PT 

PT 
XX 



AAE10628 standard; protein; 518 AA. 
AAE10628; 
10-DEC-2001 (first entry) 

Human aspartyl protease 1 (hu-Aspl) protein. 

Human; aspartyl protease 1; Aspl; amyloid precursor protein; APP • 
Alzheimer's disease; AD; dementia; neurofibrillary tangle; gliosis* 
amyloid plaque; neuronal loss; proteolytic; nootropic; neuroprotective- 
chromosome 21. ' 



Homo sapiens 
Key 

Peptide 
Protein 
Domain 



Location/Qualifiers 
1 . .20 

/label= Signal_peptide 
21. .518 

/note= "Mature human aspartyl protease 1" 
469. .492 

/label= Transmembrane domain 



GB2357767-A. 
04- JUL-2001 . 

22-SEP-2000; 2 00 0GB-00023315 



23-SEP-1999; 
23-SEP-1999; 
23-SEP-1999; 
13-OCT-1999; 
06-DEC-1999; 



99US-00404133. 
99US-0155493P. 
99WO-US020881 . 
99US-00416901. 
99US-0169232P. 



{ PHAA ) PHARMACIA & UPJOHN CO. 

Bienkowkski MJ, Gurney M; 

WPI; 2001-444208/48. 
N-PSDB; AAD17864. 

Polypeptide comprising fragments of human aspartyl protease with amyloid 
precursor protein processing activity and alpha-secretase activity, for 
identifying modulators useful in treating Alzheimer's disease 



PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 



Claim 36; Fig 1; 187pp; English. 



The patent discloses human aspartyl protease 1 (hu-Aspl) or modified Aspl 
proteins which lack transmembrane domain or amino terminal domain or 
cytoplasmic domain and retains alpha-secretase activity and amyloid 
protein precursor (APP) processing activity. The proteins of the 
invention are useful for assaying hu-Aspl alpha-secretase activity, which 
xn turn is useful for identifying modulators of hu-Aspl alpha-secretase 
activity where modulators that increase hu-Aspl alpha-secretase activity 
are usef ul _ f or treating Alzheimer's disease (AD) which causes progressive 
dementia with consequent formation of amyloid plaques, neurofibrillary 
tangles, gliosis and neuronal loss. Hu-Aspl protease substrate is useful 
for assaying hu-Aspl proteolytic activity, by contacting hu-Aspl protein 
with the substrate under acidic conditions and determining the level of 
CC hu-Aspl proteolytic activity. The present sequence is Aspl protein from 
t-f numan. Aspl gene is localised on chromosome 21 



CC 
XX 

SQ Sequence 518 AA; 



Query Match 100.0%; Score 2113; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-198; 

Matches 406; Conservative 0; Mismatches 0; ' Indels 0- Gaps 



0; 



Qy 


1 


ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLI GTPPQKLQI LVDTGS SNFAVAGTP 

1 1 11 1 1 ' ' M 1 1 1 1 I I I | | | | | | M I 1 1 I I I | | M f M 1 1 I I I | | M 1 1 1 1 1 it 1 1 1 f 1 I 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HS YI DT YFDTERS STYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIATI FE 
1 ' 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 II 1 II 1 1 1 1 1 1 1 1 | | M 1 1 1 | II 1 1 1 1 I I I | | | | | | | | | | | M 
HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 
1 1 1 11 1 " 1 ' 1 i 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
" 1 1 1 ' 1 1 ' 1 1 II 1 1 I I | | | | | | | | | | | | | | | | | , | | | , , | , , | ,, , | , | M , 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 1 1 1 1 M ' 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 | | | | | | 1 I 1 1 1 1 1 1 | | | | | | | | | | 1 1 
DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENS S RS FRITI LPQL YIQPMMGAGLNYEC YRFGI S PSTNALVI GATVMEGFYVI FDRA 

1 1 1 " ' ' 1 1 1 1 1 1 1 1 1 I I I | | | | | j 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I I M 1 1 1 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNAL VI GATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVG FAAS P CAE I AGAAVS EISGPFSTE D VASNCVPAQ S L S E P I L 4 06 

1 " 1 1 1 1 1 1 1 1 1 1 1 1 I I M 1 1 1 1 I I I | | | | | | | | 1 1 

QKRVGFAAS PCAEI AGAAVS EI SGPFSTEDVASNCVPAQSLSEPIL 4 68 




Db 


423 





Search completed: March 4, 2004, 15:35:43 
Job time : 83.0638 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



March 4, 2004, 15:31:20 



Search time 26.3468 Seconds 

(without alignments) 

795.548 Million cell updates/sec 



Title : 

Perfect score 
Sequence : 

Scoring table 



Searched : 



US-09-668-314C-2_COPY_63_4 68 
2113 

1 ALE PALAS PAGAAN FLAMVD STEDVASNCVPAQSLSEPIL 406 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



389414 seqs, 51625971 residues 



Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



389414 



Post-processing 



Minimum Match 0% 
Maximum Match 10 0% 
Listing first 45 summaries 



Database : Issued_Patents AA: * 

1 : /cgn2_6/ptodata/2/iaa/5A_COMB.pep: * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/2/iaa/6A_COMB.pep : * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5 : / cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep : * 

6 : /cgn2_6/ptodata/2/iaa/backf ilesl . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


2113 


100. 


0 


518 


3 


US- 


08-99 


2 


2113 


100. 


0 


518 


3 


US- 


09-43 


3 


2113 


100 . 


0 


518 


4 


US- 


09-54 


4 


2113 


100. 


0 


518 


4 


US- 


09-54 


5 


2113 


100. 


0 


518 


4 


us- 


09-55 


6 


2113 


100. 


0 


518 


4 


US- 


09-21 


7 


1962 


92. 


9 


514 


3 


US- 


09-71 


8 


1962 


92 . 


9 


514 


4 


US- 


09-91 


9 


1133 


53. 


6 


501 


4 


US- 


09-71 


10 


1133 


53. 


6 


501 


4 


US- 


09-54 


11 


1133 


53 . 


6 


501 


4 


us- 


09-54 



9- 


-723-2 


Sequence 


2, 


Appli 


4- 


427-2 


Sequence 


2, 


Appli 


8- 


372D-2 


Sequence 


2, 


Appli 


8- 


367D-2 


Sequence 


2, 


Appli 


1- 


853D-2 


Sequence 


2, 


Appli 


5- 


450-19 


Sequence 


19, 


Appl 


7- 


432-2 


Sequence 


2, 


Appli 


2- 


484-2 


Sequence 


2, 


Appli 


3- 


158-2 


Sequence 


2, 


Appli 


8- 


372D-8 


Sequence 


8, 


Appli 


8- 


367D-8 


Sequence 


8, 


Appli 



1 7 


1133 

-1_ -U -—J O 


S3 fi 
j j » u 


s n i 

J U _L 


1 3 


1133 


5 3 fi 


s m 

J U X 


1 4 


1 1 3P 


53 fi 


4 07 


1 s 


1 1 3P 


5 3 fi 


4 31 

fi O JL 


1 6 


1 13P 


S3 fi 

JO • u 


4 S9 
ft j <c 


1 7 


1 13P 

J- J- sj 


S3 6 


4 S 3 




1 1 3P 

J_ _L -^V ^ 


S3 fi 


4 S 3 

ft J o 


1 9 


1132 


S3 

J J » Vj 


4 S 3 
*± j o 


po 


1 1 3P 


S3 fi 


4 S fi 

fi D 


7 1 


1 1 3P 


S3 fi 


4 S Q 

fx O " 


77 


1 1 3P 


S3 fi 


4 S Q 

fi O 


p "3 


1 1 3P 


S3 fi 


4 S Q 
ft j 


P4 


1 1 3P 


S3 fi 
j o • o 


4 r n 

ft o u 


P s 


1 1 3P 

J_ J_ O <£l 


S3 fi 


zl ft ft 
fi O O 


P 6 


1 1 3P 


S3 fi 


s n i 


P 7 


1 1 3P 

J L J Z. 


S3 fi 


s n 1 


P ft 


1 1 3P 


S3 fi 


S fi 1 


P 9 


1 1 3P 


S3 fi 
JO . D 


s n i 


30 


1 1 3P 


S3 fi 
O J . D 


s n 3 

jUj 


3 1 


1 1 Pft 


S3 4 
JO • 1 




3P 


1 1 P ft 
1 ± il u 


S3 4 


/4 /3 
fi ft fi 


3 3 


1 1 P7 


S3 3 


3 3 
ft O O 


•~j " 


1 1 P7 


S3 3 
j O . O 


A 3 3 
fl o J 


3 5 


1 1 97 
j. j. ^. / 


S3 3 


j5 3 3 
fl JO 


3 fi 


1 1 71 


S3 3 


fi fi O 


37 


1 1 97 


S3 3 

J O . O 


zl /I 
fi fi O 


3 ft 


1 1 97 


^3 3 


4 /4 ^ 
fi fi O 


- — ' J 


1 1 97 


S3 3 


/ICQ 

fi D z? 


fi u 


Hi / 


C o o 
jj . o 


4 5 9 


41 


1127 


53. 3 


459 


42 


1126 


53.3 


501 


43 


1123 


53. 1 


425 


44 


1123 


53 . 1 


425 


45 


1123 


53. 1 


425 



4 US- 09- 55 1-8 5 3D- 8 

4 US-09-724-5 66A-65 

4 US-0 9-72 4-56 6A-5 8 

4 US-09-72 4-5 6 6A-74 

4 US-0 9-724-5 66A-5 9 

4 US-09-54 8-372D-30 

4 US-0 9-54 8-3 67D-30 

4 US- 09-55 1-8 53D- 30 

4 US-0 9-724-5 66A-4 3 

4 US-0 9-54 8-372D-32 

4 US-09-54 8-3 67D-32 

4 US-0 9-551-853D-32 

4 US-09-724-5 66A-66 

4 US-09-604-608-2 

4 US- 0 9-54 8-3 7 2D- 4 

4 US-0 9-54 8-367D-4 

4 US-09-551-853D-4 

4 US-0 9-724-566A-2 

4 US-09-604-608-3 

4 US- 0 9-724-5 66A- 68 

4 US-09-724-566A-67 

4 US-0 9-54 8-3 72D-2 6 

4 US-09-54 8-367D-26 

4 US- 0 9-551-8 53D-2 6 

4 US-09-54 8-372D-22 

4 US-0 9-54 8-3 67D-22 

4 US-09-551-8 5 3D-22 

4 US-0 9-54 8-372D-24 

4 US-09-54 8-367D-24 

4 US-0 9-551-853D-24 

4 US-09-009-191-2 

4 US-0 9-54 8-3 72D-2 8 

4 US-09-54 8-367D-2 8 

4 US-0 9-551-8 53D-2 8 



Sequence 8, Appli 
Sequence 65, Appl 
Sequence 58, Appl 
Sequence 74, Appl 
Sequence 59, Appl 
Sequence 30, Appl 
Sequence 30, Appl 
Sequence 30, Appl 
Sequence 43, Appl 
Sequence 32 , Appl 
Sequence 32, Appl 
Sequence 32, Appl 
Sequence 66, Appl 
Sequence 2, Appli 
Sequence 4, Appli 
Sequence 4, Appli 
Sequence 4, Appli 
Sequence 2, Appli 
Sequence 3, Appli 
Sequence 68, Appl 
Sequence 67 , Appl 
Sequence 2 6 , Appl 
Sequence 26, Appl 
Sequence 2 6, Appl 
Sequence 22, Appl 
Sequence 22 , Appl 
Sequence 22 , Appl 
Sequence 24, Appl 
Sequence 24, Appl 
Sequence 24, Appl 
Sequence 2, Appli 
Sequence 28, Appl 
Sequence 28, Appl 
Sequence 28, Appl 



ALIGNMENTS 



RESULT 1 
US-08-999-723-2 

; Sequence 2, Application US/08999723A 
; Patent No. 6025180 
; GENERAL INFORMATION : 

APPLICANT: Powell, David J. 

APPLIC7ANT: Southan, Christopher 
; APPLICANT: Chapman, Conrad G. 

APPLICANT: Evans, Joanne R. 

TITLE OF INVENTION: ASPl 

FILE REFERENCE: GH70262 

CURRENT APPLICATION NUMBER: US/ 08/999, 72 3A 
; CURRENT FILING DATE: 1997-10-06 
NUMBER OF SEQ ID NOS : 2 
SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 2 

LENGTH: 518 
TYPE: PRT 



ORGANISM: Homo sapiens 
US-08-99-9-723-2 

Query Match 100.0%; Score 2113; DB 3; Length 518; 

Best Local Similarity 100.0%; Pred. No. 7e-210; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

1 ALEPAIASPAGAANFIAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 
I I ' I II I I I I I I I I f I I I I | M I I I I ! I I I II I I I I I I I I I ! I | I | I | | | | | | | | | | | | f 
63 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 122 

61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 120 

i I I I I I I I I I II I I I I I | | | | | | | | | | I I I I I If I 

123 HS YI DTYFDTERS ST YRS KGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIAT I FE 182 

121 SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANI PNVFSMQMCGAGLPVAGS 180 

I N M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I 
183 SENFFLPGIPCWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 242 

181 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I | | | | | | | | | 
2 43 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA.lv 302 

241 DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 300 

I I M I I I II I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | 

303 DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 362 

301 RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 360 

' N I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I 
363 RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 422 

361 Q KRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N C VP AQ S L S E P I L 4 06 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
423 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 68 



RESULT 2 
US-09-434-427-2 

; Sequence 2, Application US/09434427 
; Patent No. 6162630 
; GENERAL INFORMATION: 

APPLICANT: POWELL, DAVID J. 

APPLICANT: SOUTHAN, CHRISTOPHER 
; APPLICANT: CHAPMAN, CONRAD G. 
; APPLICANT: EVANS , JOANNE R. 

TITLE OF INVENTION: ASP1 

FILE REFERENCE: GH-70262-D1 

CURRENT APPLICATION NUMBER: US/09/434,427 

CURRENT FILING DATE: 1999-11-04 

EARLIER APPLICATION NUMBER: US 08/999,723 

EARLIER FILING DATE: 1997-10-06 

EARLIER APPLICATION NUMBER: UK 9626022.9 

EARLIER FILING DATE: 1996-12-14 

NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 518 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-434-427-2 



Query Match 100.0%; Score 2113; DB 3; Length 518; 

Best Local Similarity 100.0%; Pred. No. 7e-210; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


1 


ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

1 1 1 1 M 1 I 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I | | | I | | | | | | | | | | | | | | | || | | | , | 

ALEPALAS PAGAANFIAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
' N 1 1 1 1 1 1 1 1 I I 1 I 1 1 1 I 1 1 1 I 1 f 1 1 1 1 1 1 | | f | | | | | | | | | | | | | | | | | | | | , | , | | | 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


S ENFFLPGI KWNGI LGLAYATLAKPS S SLETFFDSLVTQANI PNVFSMQMCGAGLPVAGS 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I I I I I I | | i | | | M | | | | | | | | | || | | | | | | | | | , | , | , | 

SENFFLPGIfOTNGI LGLAYATLAKPS S SLETFFDSLVTQANI PNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I I I I I i i i i i i i i i i i i t i t i i i i i t i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 If II 1 1 1 1 1 1 1 1 1 M M 1 1 1 M 1 I 1 1 1 1 1 1 1 1 1 II 1 I I | | | | | 1 1 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

N 1 1 t 1 1 I 1 f I 1 M M 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 M 1 
DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENS S RS FRITI LPQLYIQPMMGAGLNYEC YRFGI S PSTNALVI GATVMEGFYVI FDRA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I 

RDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I I I I I | | | | | | | | | | | | | | | | | | | | | | | | 

QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 68 




Db 


423 





RESULT 3 

US-09-548-372D-2 

; Sequence 2, Application US/09548372D 
; Patent No. 6420534 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 29915/62801 

CURRENT APPLICATION NUMBER: US/09/548 , 372D 

CURRENT FILING DATE: 2000-04-12 

PRIOR APPLICATION NUMBER: US 60/155,493 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US9 9/2 0 8 8 1 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 



NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 

LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-548-372D-2 

Query Match 100.0%; Score 2113; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 7e-210; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


1 


ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | M 1 1 1 1 1 ! 1 1 1 f 1 1 M 1 1 M 1 ! I I I | | M | | | i | | | | | 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 

M I 1 1 f f I f 1 I 1 1 I 1 M 1 1 1 1 1 1 I I M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | f | | 1 1 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 ! I i I i | i i r i i i i i i i i i i i i i i t i i i t i i iiiiiiii 

1 1 1 1 1 1 1 1 1 1 1 1 1 ■ > ■ M t M 1 1 M 1 1 1 1 1 1 1 | | 1 ! 1 1 II I 1 1 1 II 1 1 1 1 1 1 1 | | | | 1 M 

SENFFLPGIICWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

N 1 I 1 1 1 1 1 M 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | 1 1 1 1 1 1 1 1 II 1 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

' 1 ' 1 ' 1 1 1 1 1 1 1 1 I 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 f 1 [ II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 
DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 
> 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II | || | | | | | | | | | | | | | | | | | | | | || | | | | | | | | | 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGI SPSTNALVIGATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVG FAAS P CAE I AGAAVS E I S G P FS T EDVASNCVPAQS L S E P I L 406 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I M 1 1 1 i 1 1 1 1 1 1 1 1 1 I I | | | | | 

QKRVG FAAS P CAE I AGAAVS E I S G P FS T EDVASNCVPAQS L S E P I L 4 68 




Db 


423 





RESULT 4 

US-09-548-367D-2 

; Sequence 2, Application US/09548367D 
; Patent No. 6440698 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 29915/6280H 
; CURRENT APPLICATION NUMBER: US/09/548 , 367D 

CURRENT FILING DATE: 2000-04-12 

PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 



PRIOR APPLICATION NUMBER; PCT/US 9 9/2 0 8 8 1 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 

LENGTH: 518 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-548-367D-2 



Query Match 100.0%; Score 2113; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 7e-210; 

Matches 4 06; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


1 


ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

1 1 1 M M 1 1 1 M 1 1 1 1 I 1 1 1 1 1 1 1 I 1 1 I I | ( | | | | | | | m 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | || | 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 

1 i j i i i i i i i i i i I i i \ i i i i i i i i t i i i t i i i i i i i i i i i ■ i i i i i ■ i . , . , 

1 1 11 1 1 1 1 1 M 1 1 II 1 1 I 1 1 1 I 1 1 M | | [ I 1 1 1 1 1 1 I I I I M 1 1 1 M 1 1 1 1 1 1 1 f 1 1 1 1 1 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

" 1 > 1 i 1 I ' 1 1 1 1 M M M M 1 1 1 1 1 1 1 1 1 M 1 f 1 1 II 1 M 1 1 1 1 1 1 ! 1 1 1 1 f 1 1 1 1 | 1 1 
SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 I I M 1 1 | | | | | | | | | | | | | | || | | | | | | | | | 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 1 1 M 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDEN S S RS FRI T I L PQL YI QPMMGAGLN YEC YRFGI S P STNALVI GATVMEGFYVI FDRA 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 ! 1 II 1 1 1 1 1 1 1 1 1 I I I I I I I | | | 1 1 1 1 1 1 M 1 1 1 1 1 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVG FAAS PCAE I AGAAVS EISGPFSTE DVASNCVP AQ S L S E P I L 4 06 

1 1 M 1 1 1 I II 1 I 1 1 1 I I I | | | | | | || | | | | | | | | | | | | | | | | | | | j 

QKRVGFAASPCAEIAGAAVSEI SGPFSTEDVASNCVPAQSLSEPIL 4 68 




Db 


423 





RESULT 5 

US-09-551-853D-2 

; Sequence 2, Application US/09551853D 
; Patent No. 6500667 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL. 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 29915/6280L 
; CURRENT APPLICATION NUMBER: US/ 0 9/551 , 853D 
CURRENT FILING DATE: 2000-04-18 



PRIOR APPLICATION NUMBER: US 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/20881 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 

LENGTH: 518 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-551-853D-2 



Query Match 100.0%; Score 2113; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 7e-210; 

Matches 4 06; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


1 


ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

1 1 1 | 1 1 1 1 i i i i i i i i i i i t i j i i i i i i i i i i * i i i i i i i i i i t i i i i i i 

1 1 1 1 1 1 I I 1 1 1 1 M M M 1 M 1 M 1 1 1 1 1 1 I I | M | || ! 1 1 I I M I ! 1 1 1 1 1 1 I I I I M | 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
N I 1 1 1 1 M 1 1 1 ! 1 1 1 1 1 I I I | | | | | | I | | | | | | | | | | | | | | M 1 1 1 1 1 M If ! 1 1 1 I 1 I 
HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 
1 N 1 1 1 1 1 1 1 1 1 I I I | | | || | | | | | | | | | | | || | | | | | | | | | | | | | | | | | | | | | | | | | , | 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA.lv 

N 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 II M 1 1 1 II 1 1 1 1 1 1 1 1 1 I I I | | 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 1 1 N 1 M 1 1 1 1 1 1 1 1 1 1 1 I f I I I I | f II 1 1 1 1 1 1 1 1 1 1 I I | | | M | | 1 1 1 1 1 1 1 1 M 1 

DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 

' 1 1 1 M 1 1 1 1 1 1 I I I I | | | M 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 
RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
N 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I || I | | | 
Q KRVGFAAS P CAE I AGAAVS E I S G P FST EDVASNCVPAQ S L S E P I L 4 68 




Db 


423 





RESULT 6 

US-09-215-450-19 

; Sequence 19, Application US/09215450 

; Patent No. 6635748 

; GENERAL INFORMATION: 

; APPLICANT: Giese, Klaus 

; APPLICANT: Xin, Hong 

; TITLE OF INVENTION: METASTATIC BREAST AND COLON CANCER REGULATED GENES 



; FILE REFERENCE: 1451.100 / 210030.447 

; CURRENT APPLICATION NUMBER: US/ 09/2 15, 4 50 

CURRENT FILING DATE: 1998-12-17 
; NUMBER OF SEQ ID NOS : 27 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 19 

LENGTH: 518 

TYPE: PRT 

ORGANISM: human 
US-09-215-450-19 



Query Match 100.0%; Score 2113; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 7e-210; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


1 


ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 
N M 1 1 1 1 1 1 1 f 1 1 It 1 1 ! 1 1 1 1 1 1 1 I I 1 | j | | ! | I | | | | | M 1 1 1 1 1 I 1 I I | | | | | | f | 
ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HS YIDTYFDTERSSTYRSKGFD VTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIATI FE 

>' M I I 1 1 1 1 1 1 1 1 1 f f I I f I | | M | | | | | | | f| 1 1 1 f 1 1 1 1 1 M 1 1 1 1 1 M I I I I | | j f 

±Lt ~' x j-i-i^j. ijjao o ± inorvor uvi vim lyboWlbr VGi^DLVTI PKGFNTSFLVNIATI FE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M I 1 1 1 M f 1 I 1 I I I I I I 1 1 1 1 1 I M ! 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

SENFFLPGIPCWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

" 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 f 1 1 1 1 1 1 1 1 1 1 1 ! M 1 II 1 1 1 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 
> 1 M 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | |[ | | | | | | M 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I I 
DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWS YFPKISI YL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVIFDRA 

> N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 I I I I I I I | | | | | M 1 1 1 1 1 1 1 I I 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVIFDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 

1 1 i 1 1 1 I 1 1 1 M II 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
QKRVGFAAS P CAE I AGAAVS EISGPFSTE DVASNC VPAQ S L S EP I L 4 68 




Db 


423 





RESULT 7 
US-09-717-432-2 

; Sequence 2, Application US/09717432 
; Patent No. 6291223 
; GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



ZHU, YUAN 
LI, XIAOTONG 
CHRISTIE, GARY 
POWELL, DAVID J. 
; TITLE OF INVENTION: Mouse Aspartic Secretase-1 (mASPl) 
FILE REFERENCE: GP-70663 

CURRENT APPLICATION NUMBER: US/ 09/717 , 432 
CURRENT FILING DATE: 2000-11-21 



PRIOR APPLICATION NUMBER: 60/166,974 
PRIOR FILING DATE: 1999-11-23 
NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 
; LENGTH: 514 
TYPE: PRT 

ORGANISM: MUS MUSCULUS 
US-09-717-432-2 



Query Match 92.9%; Score 1962; DB 3; Length 514; 

Best Local Similarity 91.6%; Pred. No. 3e-194; 

Matches 372; Conservative 15; Mismatches 15; Indels 4; Gaps 



Qy 


1 


ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 
1 1 1 1 1 : 1 II 1 1 M 1 1 I 1 1 1 1 1 I i | M | | | | | | | | | | : | | | | M | | | | | | | , | , 

ALEPVRAT ANFLAMVDNLQGDSGRGYYLEMLIGTPPQKVQILVDTGSSNFAVAGAP 


60 


Db 


63 


118 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
' 1 1 1 1 1 1 1 1 : I 1 1 I I 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II 1 I I I M M 1 1 1 : 1 1 1 1 1 1 1 II 1 1 
HS YIDTYFDSESSSTYHSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNSSFLVNIATIFE 


120 


Db 


119 


178 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 
" 1 1 M M 1 1 1 1 1 1 1 1 1 1 | | 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 | : : | | M | | | | | | | | | | | 

SENFFLPGIKWNGILGLAYAALAKPSSSLETFFDSLVAQAKIPDIFSMQMCGAGLPVAGS 


180 


Db 


179 


238 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 | | | | | : 1 1 1 1 M f 1 1 1 1 | 1 1 1 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQNLNLDCREYNADKAIV 


240 


Db 


239 


298 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLI PEFSDGFWTGSQLACWTNSETPWS YFPKI S I YL 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | I | | | | | | | | | | | | : | | | | | | | | | | | | : ,, | | | | | | | 

DSGTTLLRLPQKVFDAVVEAVARTSLI PEFSDGFWTGAQLACWTNSETPWAYFPKISIYL 


300 


Db 


299 


358 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 
>> M : 1 1 1 1 1 1 1 1 1 | | | | | M 1 1 1 1 1 1 1 1 1 1 1 || If | | | | | | | | | | | | | | | | | : | | | | 
RDENAS RS FRI T I L PQLYI QPMMGAGFN YEC YRFGI S S STNALVI GATVMEGFYWFDRA 


360 


Db 


359 


418 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
1 : 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 I I I : 1 : 1 1 1 1 
QRRVGFAVSPCAEIEGTTVSEISGPFSTEDIASNCVPAQALNEPIL 4 64 




Db 


419 





RESULT 8 
US-09-912-484-2 

; Sequence 2, Application US/09912484 

; Patent No. 6358725 

; GENERAL INFORMATION: 

; APPLICANT: Christie, Gary 

; APPLICANT: Li, Xiaotong 

; APPLICANT: Powell, David J. 

; APPLICANT: Zhu, Yuan 

; TITLE OF INVENTION: Mouse Aspartic Secretase-1 (mASPl) 

FILE REFERENCE: GP-70663-D1 
; CURRENT APPLICATION NUMBER: US/09/912,4 84 

CURRENT FILING DATE: 2001-07-25 

PRIOR APPLICATION NUMBER: 60/166,974 



PRIOR FILING DATE: 1999-11-23 

PRIOR APPLICATION NUMBER: 09/717,432 

PRIOR FILING DATE: 2000-11-21 

NUMBER OF SEQ ID NOS : 2 
; SOFTWARE: Fas t SEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 514 
; TYPE : PRT 

ORGANISM: MUS MUSCULUS 
US-09-912-484-2 



Query Match 92.9%; Score 1962; DB 4; Length 514; 

Best Local Similarity 91.6%; Pred. No. 3e-194; 

Matches 372; Conservative 15; Mismatches 15; Indels 4; Gaps 1; 



Qy 


1 


ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 
M 1 < 1 : N I 1 1 1 1 1 1 | | | | | | f f| I I 1 1 1 1 1 1 | | M : 1 1 I I I I I | M 1 1 1 1 | | 
ALEPVRAT ANFLAMVDNLQGDSGRGYYLEMLIGTPPQKVQILVDTGSSNFAVAGAP 


60 


Db 


63 


118 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
1 1 1 1 1 1 I 1 1 : 1 1 1 I I I I I I | | | | | | i | | | | | M 1 1 1 1 1 1 I M | | | | : | | | | M 1 1 1 1 1 
HSYIDTYFDSESSSTYHSKGFDVTVKYTOGSWTGFVGFDT VTT PTfr tttjq q ttt ^™TArn T ^ 


120 


Db 


119 


178 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 
1 1 M M 1 I 1 1 1 I 1 1 1 !! 1 1 1 1 1 II 1 M | | | | | | | | | || 1 1 :: | | | | | | | | | | 1 1 1 1 1 

SENFFLPGIKWNGILGLAYAALAKPSSSLETFFDSLVAQAKIPDIFSMQMCGAGLPVAGS 


180 


Db 


179 


238 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

" 1 1 1 M 1 1 1 1 1 1 1 1 I I M 1 1 1 1 1 1 If 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 f 1 I i 1 M 1 1 1 1 1 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQNLNLDCREYNADKAIV 


240 


Db 


239 


298 


Qy 


241 


DSGTTLLRLPQKVFDAVVEAVARASLI PEFSDGFWTGSQLACWTNSETPWS YFPKI SI YL 
' 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I I | 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 I I I 1 1 1 1 1 1 1 1 : I If | | | j 1 1 
DSGTTLLRLPQKVFDAWEAVARTSLIPEFSDGFWTGAQLACWTNSETPWAYFPKISIYL 


300 


Db 


299 


358 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 

M 1 1 : 1 1 1 1 1 1 1 1 1 1 1 I IN | | | | | | | | 1 1 1 1 1 M | | | | | | | | | | | | : | | | | 

RDENAS RS FRI T I LPQL YI QPMMGAGFN YEC YRFGI S S STNALVI GATVMEGFYWFDRA 


360 


Db 


359 


418 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
1 : 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 : 1 I I | | | | | : | : | | f | 

QRRVGFAVSPCAEIEGTTVSEISGPFSTEDIASNCVPAQALNEPIL 4 64 




Db 


419 





RESULT 9 
US-09-713-158-2 

Sequence 2, Application US/09713158 
Patent No. 6361975 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



ZHU, YUAN 
LI, XIAOTONG 
POWELL, DAVID J. 
CHRISTIE, GARY 
TITLE OF INVENTION: MOUSE ASPARTIC S ECRETAS E- 2 (MASP-2) 
FILE REFERENCE: GP-70660 

CURRENT APPLICATION NUMBER: US/ 09/7 13 , 15 8 
CURRENT FILING DATE: 2000-11-15 



; PRIOR APPLICATION NUMBER: 60/165,800 
PRIOR FILING DATE: 1999-11-16 
NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 501 
TYPE: PRT 

ORGANISM: MUS MUSCULUS 
US-09-713-158-2 



Query Match 53.6%; Score 1133; DB 4; Length 501; 

Best Local Similarity 53.7%; Pred. No. 2e-108; 

Matches 208; Conservative 65; Mismatches 110; Indels 4; Gaps 



Qy 


9 


PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 
1 ! 1 : 1 1 1 1 1 : I | | : | | | : | | : | : | | | | | | | | | | | | | | | | | | : . , . 

PGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYY 


68 


Db 


54 


113 


Qy 


69 


DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG 


128 


Db 


114 


: M 1 1 1 I 1 1 1 1 1 1 1 : 1 1 1 1 : I I I | : 1 1 ! 1 f I : | | : | 
QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 


173 


Qy 


129 


IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV AGSGTNGG 

> 1 1 1 1 1 1 1 1 : 1 : 1 III 1 1 1 1 1 1 I : 1 1 1 : 1 1 : I : M 1 I 1 : : II 
SNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTEALASVGG 


185 


Db 


174 


233 


Qy 


186 


SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 
1 : : : 1 1 1 : III 1 : 1 1 1 1 1 : I I I | : : | : : : || II 1 : 1 1 : I | | I I : 1 1 1 1 II I 
SMI IGGI DHSLYTGSLWYTPI RREWYYEVI I VRVEINGQDLKMDCKEYNYDKSIVDSGTT 


245 


Db 


234 


293 


Qy 


246 


LLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENS 
1 1 1 1 = 1 1 1 : 1 1 : : : II : ! II 1 f| | : | | | | : | | | : 

NLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVT 


305 


Db 


294 


353 


Qy 


306 


SRS FRITI LPQLYI QPMMGAGLNY-ECYRFGI S PSTNAL VI GAT VMEGFYVI FDRAQKRV 
: =IMMMM MM: : : M : 1 : I I : | : | | : 1 I I | | 1 : 1 1 M : 1 1 : 
NQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWFDRARKRI 


364 


Db 


354 


413 


Qy 


365 


GFAASPCAEIAGAAVSEISGPFSTEDV 391 




Db 


414 


Mill : : | | | | | : 
GFAVS ACHVHDE FRTAAVEGP FVTADM 44 0 





RESULT 10 
US-09-548-372D-8 

; Sequence 8, Application US/09548372D 
; Patent No. 6420534 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 29915/62801 
; CURRENT APPLICATION NUMBER: US/ 0 9/ 54 8 , 372D 

CURRENT FILING DATE: 2000-04-12 
; PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 



PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/2 08 8 1 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 

NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 

LENGTH: 501 
TYPE: PRT 
; ORGANISM: Mus mus cuius 
US-09-548-372D-8 



Query Match 53.6%; Score 1133; DB 4; Length 501; 

Best Local Similarity 53.7%; Pred. No. 2e-108; 

Matches 208; Conservative 65; Mismatches 110; Indels 4; Gaps 



Qy 


9 


PAGAANFIAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHS YIDTYF 
1 : ' : 1 1 1 I 1 : 1 1 1 : I | ! : | I : | : | || | 1 1 | | || I 1 1 1 1 1 If : : | : 

PGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYY 


68 


Db 


54 


113 


Qy 


69 


DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG 
: 1 ' ) 1 1 I | | | | | | | : | | | | : | | | | : | | | | | | : | | : | 

QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 


128 


Db 


114 


173 


Qy 


129 


IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV AGSGTNGG 

' l M i 1 1 1 I : 1 : 1 1 1 1 I I I I | | | : | | | : | | : | : | | | | | : . , | 

SNWEGILGLAYAEI7VRPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTEALASVGG 


185 


Db 


174 


233 


Qy 


186 


SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 
' ::: 'H: III 1 : 1 1 1 1 1 : MM:: j : : : | | | | | : || : 1 1 1 1 1 : 1 I I I I i | 

SMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTT 


245 


Db 


234 


293 


Qy 


246 


LLRLPQKVFDAVVEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKISI YLRDENS 
1 1 1 1 : 1 1 1 : 1 1 : : : 1 1 :| Mil 1 II || 1 M : 1 1 1 1 : If 1 : 
NLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNI FPVI SLYLMGEVT 


305 


Db 


294 


353 


Qy 


306 


S RS FRI T I LPQL YI QPMMGAGLN Y- EC YRFGI S P S TNALVI GATVMEGFYVI FDRAQKRV 
: : 1 1 1 1 ' 1 1 1 1 1 : : 1 : : : | | : | : | | : 1 : II : 1 1 1 1 II : 1 1 1 1 : 1 I : 
NQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWFDRARKRI 


364 


Db 


354 


413 


Qy 


365 


GFAASPCAEIAGAAVSEISGPFSTEDV 391 




Db 


414 


1 1 1 1 1 : : 1 1 1 1 1 : 
G FAVS ACH VH D E FRT AAVE G P FVT ADM 440 





RESULT 11 
US-09-548-367D-8 

; Sequence 8, Application US/09548367D 
; Patent No. 6440698 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 29915/6280H 



; CURRENT APPLICATION NUMBER: US/09/548, 367D 
CURRENT FILING DATE: 2000-04-12 
PRIOR APPLICATION NUMBER: US 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/2 08 8 1 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 60/101,594 
PRIOR FILING DATE: 1998-09-24 

; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patentln version 3.1 

; SEQ ID NO 8 

LENGTH: 5 01 
TYPE: PRT 

ORGANISM: Mus musculus 
US-09-548-367D-8 



Query Match 53.6%; Score 1133; DB 4; Length 501; 

Best Local Similarity 53.7%; Pred. No. 2e-108; 

Matches 208; Conservative 65; Mismatches 110; Indels 4; Ga 



Qy 


9 


PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 
' : ' : M M 1 : 1 1 1 : 1 1 1 : II : I : I || 1 1 1 1 I 1 1 1 1 1 1 I I | | : : | : 
PGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYY 


68 


Db 


54 


113 


Qy 


69 


DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG 
: Mill 1 | | f | | | | : | | | | : || | | : I I | | | | : | j : j 
QRQLS STYRDLRKGVYVP YTQGKWEGELGTDLVS I PHGPNVTVRANIAAITES DKFFING 


128 


Db 


114 


173 


Qy 


129 


IKWNGILGLAYATLAKPSSSLETFFDSLVTQANI PNVFSMQMCGAGLPV AGSGTNGG 

1 1 1 1 1 ! 1 1 1 : 1 : 1 III 1 1 1 1 1 1 1 : 1 1 1 : 1 1 : 1 : | | | | | : : | | 
SNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTEALASVGG 


185 


Db 


174 


233 


Qy 


186 


SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 
1 : : : N 1 : Ml 1 : 1 1 1 1 1 : MM:: I : : : | 1 | | | :||:||| 1 1 M 1 II M 1 

SMI IGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTT 


245 


Db 


234 


293 


Qy 


246 


LLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWS YFPKISI YLRDENS 
1 1 1 1 : 1 1 1 : 1 1 : : : || : 1 1 1 1 1 1 II II 1 II : | | | | : || 1 : 
NLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNI FPVI SLYLMGEVT 


305 


Db 


294 


353 


Qy 


306 


S RS FRI T I LPQL YI Q PMMGAGLN Y- EC YRFGI S P STNALVI GATVMEGFYVI FDRAQKRV 
: : 1 1 1 1 1 1 II 1 MM: : : II M M I : MM : | I I || I : 1 M 1 : 1 1 : 
NQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWFDRARKRI 


364 


Db 


354 


413 


Qy 


365 


GFAASPCAEIAGAAVSEISGPFSTEDV 391 




Db 


414 


Mill : : Ml 1 I: 
G FAVS AC H VH D E F RT AAVE G P FVT ADM 44 0 





RESULT 12 
US-09-551-853D-8 

; Sequence 8, Application US/09551853D 
; Patent No. 6500667 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL . 



; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 29915/6280L 
; CURRENT APPLICATION NUMBER: US/09/551, 853D 

CURRENT FILING DATE: 2 000-04-18 

PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/20881 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 

NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 8 

LENGTH: 501 
TYPE: PRT 

ORGANISM: Mus mus cuius 
US-09-551-853D-8 



Query Match 53.6%; Score 1133; DB 4; Length 501; 

Best Local Similarity 53.7%; Pred. No. 2e-108; 

Matches 208; Conservative 65; Mismatches 110; Indels 4; Gaps 



Qy 


9 


PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 
1 : 1 : 1 M M : J | | : | | | : | | : | : | | | | I I I I I I I I I f | | | | : : | : 

PGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGJ^APHPFLHRYY 


68 


Db 


54 


113 


Qy 


69 


DTERS STYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATI FESENFFLPG 


128 


Db 


114 


: 1 1 1 1 ! | | | | | | | | : | | M : | | | | : I 1 1 I | | : | | : | 
QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 


173 


Qy 


129 


IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV AGSGTNGG 

' MINIM : 1 : 1 M 1 M M M 1 : 1 M : M : M 1 1 I 1 I : : | | 
SNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTEALASVGG 


185 


Db 


174 


233 


Qy 


186 


SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 


245 


Db 


234 


1 : : M M : III I : i 1 M I : III!:: | : : : | | | | | : | | : | | | || : II 1 I 1 1 1 
SMI IGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTT 


293 


Qy 


246 


LLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWS YFPKISI YLRDENS 
M 1 M 1 1 1 : 1 I : : : | | : 1 1 1 1 1 1 1 1 I I II M 1 1 II : 1 1 1 : 
NLRLPKKVFEAAVKS I KAAS STEKFPDGFWLGEQLVCWQAGTT PWNI FPVI SL YLMGEVT 


305 


Db 


294 


353 


Qy 


306 


S RS FRI T I LPQL YI QPMMGAGLN Y - EC YRFGI S PS TNALVI GATVMEGFYVI FDRAQKRV 
= : 1 M II 1 1 1 1 I : : | : : : | | : | : | | : | . |, : | | II II : II 1 I : 1 1 : 
NQS FRI TILPQQYLRP VED VATS QDDCYKFAVSQSSTGTVMGAVIMEGFYWFDRARKRI 


364 


Db 


354 


413 


Qy 


365 


GFAASPCAEIAGAAVSEISGPFSTEDV 391 
Mill : : | | | | | : 
G FAVS ACHVH D E FRT AAVE G P FVT ADM 4 40 




Db 


414 





RESULT 13 
US-09-724-5 66A-65 



Sequence 65, Application US/09724566A 
Patent No. 6627739 
GENERAL INFORMATION: 
APPLICANT: Anderson, John P. 
APPLICANT: Basi, Guriqbal 
APPLICANT: Doane, Minn Tam 
APPLICANT: Frigon, No. 6627739mand 
APPLICANT: John, Varghese 
APPLICANT: Power, Michael 
APPLICANT: Sinha, Sukanto 
APPLICANT: Tatsuno, Gwen 
APPLICANT: Tung, Jay 
APPLICANT: Wang, Shuwen 
APPLICANT: McConlogue, Lisa 

TITLE OF INVENTION: Beta-Secretase Enzyme Compositions and 
TITLE OF INVENTION: Methods 
FILE REFERENCE: 22 8-US-NEWC2 

CURRENT APPLICATION NUMBER: US/09/724 , 566A 
CURRENT FILING DATE: 2000-11-28 
PRIOR APPLICATION NUMBER: US 09/501,708 
PRIOR FILING DATE: 2000-02-10 
PRIOR APPLICATION NUMBER: 60/119,571 
PRIOR FILING DATE: 1999-02-10 
PRIOR APPLICATION NUMBER: 60/139,172 
PRIOR FILING DATE: 1999-06-15 
NUMBER OF SEQ ID NOS : 104 

SOFTWARE: Fast SEQ for Windows Version 4.0 
SEQ ID NO 65 
LENGTH: 501 
TYPE: PRT 

ORGANISM: Mus musculus 
US- 09-72 4-566A-65 



Query Match 53.6%; Score 1133; DB 4; Length 501; 

Best Local Similarity 53.7%; Pred. No. 2e-108; 

Matches 208; Conservative 65; Mismatches 110; Indels 4; Gaps 



Qy 


9 


PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 
1 : 1 : 1 1 1 1 1 = 1 1 1 : 1 1 I : M : 1 : 1 1 1 | | | | | | | | | | I | | | | : : | : 

PGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYY 


68 


Db 


54 


113 


Qy 


69 


DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATIFESENFFLPG 
: 1 1 1 1 ' 1 I I 1 1 1 1 I : I | | | : | | | | : | | | | | | : | | : | 

QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 


128 


Db 


114 


173 


Qy 


129 


IKWNGILGLAYATLAKPSSSLETFFDSLVTQANI PNVFSMQMCGAGLPV AGSGTNGG 

1 1 1 1 1 1 1 I I : 1 : 1 II 1 1 1 1 1 1 1 1 : 1 I I : I | : f : | | | | | : : | | 

SNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHI PNIFSLQLCGAGFPLNQTEALASVGG 


185 


Db 


174 


233 


Qy 


186 


SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 
1 : : : 1 1 1 : Ml 1 : 1 1 1 1 1 : MM:: 1 : : : | | | | | : || : 1 1 1 1 I : I || 1 1 1 1 
SMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTT 


245 


Db 


234 


293 


Qy 


246 


LLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI YLRDENS 
M 1 1 : 1 1 1 : 1 1 : : : | | : 1 1 1 1 1 1 1 I I I 1 1 1 : II 1 1 : I I 1 : 
NLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYIMGEVT 


305 


Db 


294 


353 



Q y 306 SRSFRITILPQLYIQPMMGAGLNY-ECYRFGISPSTNALVIGATVMEGFYVI FDRAQKRV 364 

• : I I I I I I I I I I : : I : : : I I : I : | | : I : I ( : I I I I || : M I I : I I : 

DiD 354 ^QSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYVVFDRARKRI 413 

365 GFAASPCAEIAGAAVSEISGPFSTEDV 391 
III I I : : t M I I : 

Db 414 GFAVSACHVHDEFRTAAVEGPFVTADM 440 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



RESULT 14 
US-09-724-566A-58 

; Sequence 58, Application US/09724566A 
Patent No. 6627739 
GENERAL INFORMATION: 
APPLICANT: Anderson, John P. 

Basi, Guriqbal 
Doane, Minn Tarn 
Frigon, No. 6627739mand 
John, Varghese 
Power, Michael 
Sinha, Sukanto 
Tatsuno, Gwen 
Tung, Jay 
Wang, Shuwen 
McConlogue, Lisa 
TITLE OF INVENTION: Beta-Secretase Enzyme Compositions and 
TITLE OF INVENTION: Methods 
FILE REFERENCE: 22 8-US-NEWC2 

CURRENT APPLICATION NUMBER: US/ 0 9/724, 566A 
CURRENT FILING DATE: 2000-11-28 
PRIOR APPLICATION NUMBER: US 09/501,708 
PRIOR FILING DATE: 2000-02-10 
PRIOR APPLICATION NUMBER: 60/119,571 
PRIOR FILING DATE: 1999-02-10 
PRIOR APPLICATION NUMBER: 60/139,172 
PRIOR FILING DATE: 1999-06-15 
NUMBER OF SEQ ID NOS : 104 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 58 
LENGTH: 4 07 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-724-566A-58 

Query Match 53.6%; Score 1132; DB 4; Length 407; 

Best Local Similarity 54.0%; Pred. No. 1.8e-108; 

Matches 209; Conservative 65; Mismatches 109; Indels 4; Gaps 2; 

QY 9 PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 68 

1 : I : I M I I : I II : I I I : | | : | : I M I I I I I I I I | | | | | | | : : | s 

Db 9 PGRRGS FVEMVDNLRGKS GQGYYVEMTVGS P PQTLNI LVDTGSSNFAVGAAPHP FLHRYY 68 

Q y 69 DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG 128 

: I I I I I I I M M I I : I I I I : I f | | : I i I I I I : I I : | 

Db 59 QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 128 



Qy 



129 I KWN G I L G LAYAT LAK PSSSLETFFDS LVT QAN I PN VF SMQMC GAGL P VAG S GTNGG 185 



Db 


129 


Qy 


186 


Db 


189 


Qy 


246 


Db 


249 


wy 


^ n a 


Db 


309 


Qy 


365 


Db 


369 



I MINIM : I : | M | MINI I : : I I : I I : i : I I i I : I : | 



— — ^.^^yi^^uv^uiju^rujanx vjj^iji! 

1 : : : M I : III I : I I I M : | | | | :: | : : : | | | | | : | | : | | | | | : | | | | | | | 



'IMMII:!!::: I : I I I II I I I I I I I I : I I I I : | | I 



- ~ ^^.i^vxuni vi-muj:iviruKhyi\KV 

"MINIMI I : : I : : : I I : | | | | : | : | | : | | , ) M . , , , , . , , . 



HI I I : : III I I: 



RESULT 15 
US-09-724-566A-74 

Sequence 74, Application US/09724566A 
Patent No. 6627739 
GENERAL INFORMATION: 
APPLICANT: Anderson, John P. 
APPLICANT: Basi, Guriqbal 
APPLICANT: Doane, Minn Tarn 
APPLICANT: Frigon, No. 6627739mand 
APPLICANT: John, Varghese 
APPLICANT: Power, Michael 
APPLICANT: Sinha, Sukanto 
APPLICANT: Tatsuno, Gwen 
APPLICANT: Tung, Jay 
APPLICANT: Wang, Shuwen 
APPLICANT: McConlogue, Lisa 

TITLE OF INVENTION: Beta-Secretase Enzyme Compositions and 
TITLE OF INVENTION: Methods 
FILE REFERENCE: 22 8-US-NEWC2 

CURRENT APPLICATION NUMBER: US / 0 9/ 724 , 5 66A 
CURRENT FILING DATE: 2000-11-28 
PRIOR APPLICATION NUMBER: US 09/501,708 
PRIOR FILING DATE: 2000-02-10 
PRIOR APPLICATION NUMBER: 60/119,571 
PRIOR FILING DATE: 1999-02-10 
PRIOR APPLICATION NUMBER: 60/139,172 
PRIOR FILING DATE: 1999-06-15 
NUMBER OF SEQ ID NOS : 104 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 74 
LENGTH: 4 31 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-0 9-724-566A-74 

Query Match 53.6%; Score 1132; DB 4; Length 431; 

Best Local Similarity 54.0%; Pred. No. 2e-108; 



Db 



Matches 209; Conservative 65; Mismatches 109; Indels 4; Gaps 2; 

Qy 9 P^GAANFLAMVDNLQGDS GRGYYLEMLI GT PPQKLQILVDTGS SNFAVAGTPHS YI DT YF 68 

I : I : I I M I : I I I : | | | : | | : | : | | | | | | | | , | |, , | , | . . , . 

Db 33 PGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAA.PHPFLHRYY 92 

Qy 69 DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG 128 

: IN I I I I MM | | :| || |:|, , | : Ml I M: M: I 

Db 93 Q^QLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVR7\NIAAITESDKFFING 152 

Qy 129 IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS — GTNGG 185 

I M M M M : | : | Ml I I II M I : M M I M M I I I I I : I - || 

153 SNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGG 212 

Qy 186 SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 245 

I : : : N I : II I I : | || || : | | | | : : | : : : | | | | | : |, : II I I M M I I I I I 
Db 213 SMIIGGIDHSLYTGSLWYTPIRREWYYEVI IVRVEINGQDLKMDCKEYNYDKSIVDSGTT 272 

Qy 246 ^^RLPQKVFDAVV^EAVARASLI PEFSDGFWTGSQLACWTNSETPWS YFPKIS 305 

I I M : M I : i I : : : I I : I M M ! M M II I : I I II * M I * 

Db 273 ^LRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVT 332 

QY 306 SRSFRITILPQLYIQPMMGAGLNY-ECYRFGISPSTNALVIGATVMEGFYVIFDRAQKRV 364 

: : I I M II II I I : : I : : : I | : | || | : | : | | M I M M * M I I * I I * 

Db 333 NQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKRI 392 

Qy 365 GFAASPCAEIAGAAVSEISGPFSTEDV 391 

III I I : : I I I I I : 

Db 393 G FAVS AC HVH D E FRTAAVE G P FVT L DM 419 



Search completed: March 4, 2004, 15:42:14 
Job time : 27.3468 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: March 4, 2004, 15:30:05 ; 



Search time 22.0277 Seconds 

(without alignments) 

1772.942 Million cell updates/sec 



Title: 

Perfect score 
Sequence : 

Scoring table 



Searched 



US-09-66 8-314C-2_COPY_63_4 6 8 
2113 

1 ALEPALAS PAGAANFLAMVD STEDVASNCVPAQSLSEPIL 406 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



283366 



Post-processing : 



Database 



Minimum Match 0% 

Maximum Match 100% 

Listing first 45 summaries 

PIR 78:* 



1 
2 
3 
4 



pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


1 


1132 


53 . 6 


501 


2 


A59090 


2 


359.5 


17 . 0 


377 


1 


PEMQCJ 


3 


355 


16. 8 


389 


2 


JE0371 


4 


351.5 


16.6 


388 


2 


JC7246 


5 


350. 5 


16. 6 


388 


2 


A29937 


6 


346. 5 


16.4 


383 


2 


JC7573 


7 


339. 5 


16. 1 


384 


2 


A39314 


8 


324.5 


15.4 


394 


2 


B43356 


9 


318 


15.0 


385 


2 


JC7575 


10 


313 


14 . 8 


392 


1 


A24608 


11 


306. 5 


14 . 5 


383 


2 


A41443 


12 


306. 5 


14 . 5 


410 


1 


KHMSD 


13 


305 


14 . 4 


407 


1 


KHRTD 



Description 



aspartic proteinas 
gastricsin (EC 3.4 
pepsin C (EC 3.4.2 
pepsinogen C - com 
gastricsin (EC 3.4 
pepsinogen C - Afr 
gastricsin (EC 3.4 
gastricsin (EC 3.4 
pepsinogen A - bul 
gastricsin (EC 3.4 
pepsin (EC 3.4.23. 
cathepsin D (EC 3. 
cathepsin D (EC 3. 



1 A 
X ft 


O U 4 


1 A 
1 4 


. 4 


4 Uz 


1 


REMSK 


1 s 

X O 


^ n 9 


1 A 
X 4 


o 
. o 


o o 4 


Z 


JC / 5 /4 


1 

X D 


■3UZ 


X fl 


o 
. o 


4 (Jo 


o 
Z 


A/DO / 9 


1 7 




X 4 


. O 


i n o 

o y o 


Z 


S 664 6b 


1 ft 
X o 


9 Q ft R 


1 A 
X 4 


. 1 


412 


1 


KHHUD 


1 Q 

X 


9 Q 7 
Z y / 


1 A 
X 4 


. 1 


o O 1 

oo / 


2 


C38302 


Z u 


9Q7 
Z y / 


1 4 


. 1 


O Q ft 

o o 8 


1 


fi T XT T 

PEHU 


9 1 
z _l 


Z O 


x 4 


. u 


o O O 

o o 8 


2 


A3 014 2 


9 9 
Z Z 


9 Q ^ 


14 


. U 


o o o 

o o 8 


2 


B30142 


9 ^ 
z o 


9 Q £ 


1 A 

x 4 


. U 


o c\ a 

o y 8 


2 


151185 


9 4 
Z " 


9 Q ^ 
Z y O 


14 


. V 


4 01 


1 


REMSS 


9 R 
Z O 


9 Q A 
Z y 4 


lo 


Q 


"3 o a 

3 8 8 


1 


S19684 


9 

Z D 


9 Q 4 
Z y H 


lO , 


. y 


4 U 0 


2 


147099 


9 7 
Z / 


z y o 


1 O . 


. y 


O f\ n 

387 


2 


D38302 


9 ft 
Z o 


9 Q 1 

z y x 


lo . 


o 
. o 


"3 O Ci 

38 8 


1 


S19682 


9 Q 

Z. -7 


9 Q 1 
Z. ^ X 


1 "3 
X O « 


p 

i O 


a r\ tz 
4 U b 


1 


REHUK 




z y u . o 


1 O , 


, / 


396 


2 


S36865 


O X 


z y u 


1 O . 


, 1 


4 02 


1 


RERTK 


^ 9 
OZ 


9 ft Q 

z o y 


1 O . 


, 1 


O O T 

387 


2 


E38302 


J O 


9 ft ft 

Zoo 


1 O . 


b 


"i o *n 

387 


2 


B38302 


A 
O 4 


9 ft Q 
Z O O 


lo . 


O 


o o n 

38 8 


1 


PEMQAJ 




9 ft 7 R 
Z O / • O 


X o . 


D 


ooy 


2 


S66516 


O D 


9 ft 7 R 
Z O / . O 


lo. 


D 


/~ O 't 

632 


2 


T45858 


O / 


Z o / 


lo . 


AT 

o 


391 


2 


A4 335 6 


^ ft 
O o 


TOT 

Z O / 


1 O . 


D 


396 


2 


A3 4 4 01 


O y 


9 ft £ 
Z O D 


lo . 


O 


O O '1 

382 


1 


PECH 


4 U 


o o s~ 


13 . 


5 


388 


1 


PEMQAR 


41 


285. 5 


13. 


5 


387 


2 


JC7245 


42 


285 


13. 


5 


506 


2 


T07915 


43 


284.5 


13. 


5 


386 


1 


PEPG 


44 


284 . 5 


13. 


5 


508 


2 


D85056 


45 


282.5 


13. 


4 


381 


1 


CMSHB 



renin (EC 3.4.23.1 
pepsinogen A - Afr 
saccharopepsin (EC 
cathepsin E (EC 3. 
cathepsin D (EC 3. 
pepsin (EC 3.4.23. 
pepsin A (EC 3.4.2 
pepsin A (EC 3.4.2 
pepsin A (EC 3.4.2 
cathepsin D (EC 3. 
renin (EC 3.4.23.1 
pepsin A (EC 3.4.2 
renin (EC 3.4.23.1 
pepsin (EC 3.4.23. 
pepsin A (EC 3.4.2 
renin (EC 3.4.23.1 
cathepsin E (EC 3. 
renin (EC 3.4.23.1 
pepsin (EC 3.4.23. 
pepsin (EC 3.4.23. 
pepsin A (EC 3.4.2 
oryzasin (EC 3.4.2 
hypothetical prote 
cathepsin E (EC 3. 
cathepsin E (EC 3. 
pepsin A (EC 3.4.2 
pepsin A (EC 3.4.2 
pepsinogen A - com 
probable aspartic 
pepsin A (EC 3.4.2 
probable aspartic 
chymosin (EC 3.4.2 



ALIGNMENTS 



RESULT 1 
A59090 

aspartic proteinase (EC 3.4.23.-) BACE precursor - human 
N;Alternate names: beta-secretase ; beta-site APP cleaving enzyme 
C; Species: Homo sapiens (man) 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change ll-May-2000 
C;Accession: A59090 

R;Vassar, R. ; Bennett, B.D.; Babu-Khan, S.; Kahn, S . ; Mendiaz, E.A.; Denis, P.; 
Teplow, D.B.; Ross, S.; TAmarante, P.; Loeloff, R. ; Luo, Y. ; Fisher, S.; Fuller, 
J.; Edenson, S.; Lile, J.; Jarosinski, M.A. ; Biere, A.L.; Curran, E.; Burgess, 
T.; Louis, J.C.; Collins, F. ; Treanor, J.; Rogers, G. ; Citron, M. 
Science 286, 735-741, 1999 

A; Title: beta-Secretase cleavage of Alzheimer's amyloid precursor protein by the 
transmembrane aspartic protease BACE. 

A; Reference number: A59090; MUID : 2 0002 972 ; PMID : 10531 052 
A;Note: submitted to GenBank, September 1999 
A; Accession: A59 090 

A; Status: not compared with conceptual translation 
A;Molecule type: mRNA 
A; Residues: 1-501 <VAS> 



A; Cross-references : GB:AF190725; NID : g611853 8 ; PIDN: AAF04142 . 1 ; PID:g6118539 
C; Genetics : 

A; Gene : BACE 

C; Superf amily : beta-secretase 

C; Keywords: Alzheimer's disease; aspartic proteinase; brain; glycoproteins- 
hydrolase; protein digestion; transmembrane protein; zymogen 
F; 1-21/Domain: signal sequence #status predicted <SIG> 
F;22-45/Domain: propeptide #status predicted <PRO> 
F;46-501/Product: acid proteinase BACE #status predicted <MAT> 
F; 461-477/Domain: transmembrane #status predicted <TRN> 
F; 93, 289/Active site: Asp #status predicted 

F;153,172,223,354/Binding site: carbohydrate (Asn) (covalent) Status predicted 
F; 330-38 0/ Disulfide bonds: #status predicted 



Query Match 53.6%; Score 1132; DB 2; Length 501; 

Best Local Similarity 54.0%; Pred. No. 5.1e-82; 

Matches 209; Conservative 65; Mismatches 109; Indels 4; Gaps 



Qy 


9 


PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 
1 : 1 : 1 f 1 II : 1 1 1 : 1 1 1 : 1 1 : I : | | | f | | | | | | | | | | | | | | . - , . 

PGRRGS FVEMVDNLRGKS GQG YYVEMTVGS P PQTLNI LVDTGS SNFAVGAAPH P FLHRYY 


68 


Db 


54 


113 


Qy 


69 


DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI FESENFFLPG 
: " 1 1 1 1 1 1 1 1 1 1 1 : I | | | : M 1 1 : 1 1 1 1 1 1 : I | : | 
QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 


128 


Db 


114 


173 


Qy 


129 


IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS GTNGG 

> 1 1 1 M 1 1 1 : 1 : 1 III 1 1 II 1 1 1 : : M : | I : | : | | | | | : | : | , 

SNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGG 


185 


Db 


174 


233 


Qy 


186 


SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 
' : : : " 1 : 1 I 1 1 : 1 1 1 I I : I | | | : : | : : : j | | | | : | | : 1 1 1 1 1 : 1 1 1 1 1 II 

SMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVE1NGQDLKMDCKEYNYDKSIVDSGTT 


245 


Db 


234 


293 


Qy 


246 


LLRLPQKVFDAVVEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKI SI YLRDENS 
1 1 M : 1 1 1 : 1 I : : : II :| IMI 1 II II f 1 1 : 1 1 I I : I I f : 
NLRLPKKVFEAAVKS I KAAS STEKFPDGFWLGEQLVCWQAGTTPWNI FPVI S L YLMGEVT 


305 


Db 


294 


353 


Qy 


306 


SRS FRI T I L PQL YIQPMMGAGLN Y- EC YRFGI S P STNALVI GAT VMEGFYVI FDRAQKRV 
s : 1 M 1 1 1 1 1 1 1 : : 1 = : : II : 1 II 1 : 1 : If : 1 M II 1 : 1 1 1 1 : II : 
NQS FRIT I LPQQYLRPVEDVATSQDDCYKFAI SQSSTGTVMGAVIMEGFYWFDRARKRI 


364 


Db 


354 


413 


Qy 


365 


GFAASPCAEIAGAAVSEI SGPFSTEDV 391 

1 1 1 1 1 : : III 1 1: 
GFAVSACHVHDEFRTAAVEGPFVTLDM 44 0 




Db 


414 





RESULT 2 
PEMQCJ 

gastncsin (EC 3.4.23.3) precursor - Japanese macaque (fragment) 
N; Alternate names: pepsin C 

C; Species: Macaca fuscata (Japanese macaque) 

C;Date: 13-Aug-1986 #sequence_revision 19-Oct-1995 #text change 18-Jun-1999 
C;Accession: S19683; A00986; A22402; S16066 
R;Kageyama, T.; Tanabe, K. ; Koiwai, O. 
Eur. J. Biochem. 202, 205-215, 1991 



A; Title: Development-dependent expression of isozymogens of monkey pepsinogens 
and structural differences between them. 

A; Reference number: S19681; MUID: 92037645; PMID: 1935977 
A;Accession: S19683 
A;Molecule type: mRNA 
A; Residues: 1-377 <KAG> 

A;Cross-references: EMBL:X59754; NID:g38072; PIDN : CAA42426 . 1 ; PID:g38073 

R;Kageyama, T. ; Takahashi, K. 

J. Biol. Chem. 261, 4406-4419, 1986 

A; Title: The complete amino acid sequence of monkey progastricsin . 
A; Reference number: A00986; MUID : 86168133; PMID : 3514 597 
A;Accession: A00986 
A;Molecule type: protein 

A; Residues : 6-330, ' V , 332-349, ' VY ' , 350-377 <KA2> 
R;Kageyama, T.; Takahashi, K. 
J. Biochem. 97, 1235-1246, 1985 

A;Title: Monkey pepsinogens and pepsins. VII. Analysis of the activation process 

and determination of the NH2-terminal 60-residue sequence of Japanese monkey 

progastricsin, and molecular evolution of pepsinogens. 

A; Reference number: A22402; MUID : 8528 9106 ; PMID: 3928607 

A;Accession: A22402 

A; Molecule type: protein 

A; Residues: 6-65 <KA3> 

C; Comment: This enzyme has more restricted specificity than pepsin A 
C;Comment: The enzyme is activated in a two-step process that gives rise to two 
end products. The shorter, Ser-gas tricsin, is the major product. 
C; Superf amily : pepsin 

C; Keywords: aspartic proteinase; gastric juice; hydrolase; protein digestion; 
stomach 

F;l-5/Domain: signal sequence (fragment) #status predicted <SIG> 
F; 6-37 7 /Product : progastricsin #status experimental <ZYM> 
F; 6- 4 5/ Domain : activation peptide #status experimental <APT> 
F;46-377/Product: Gly-gas tricsin #status experimental <MIN> 
F;49-377/Product : Ser-gastricsin #status experimental <MAT> 
F;31-32/Cleavage site: Phe-Leu (pepsin) #status experimental 
F;45-46/Cleavage site: Phe-Gly (pepsin) #status experimental 
F;48-49/Cleavage site: Leu-Ser (pepsin) #status experimental 
F; 80, 265/Active site: Asp #status predicted 

F;93-98, 256-260, 299-332/Disulf ide bonds: #status experimental 

Query Match 17.0%; Score 359.5; DB 1; Length 377; 

Best Local Similarity 30.1%; Pred. No. 7.5e-21; 

Matches 112; Conservative 60; Mismatches 111; Indels 89; Gaps 17; 

QY 30 YYLEMLIGTPPQKLQILVDTGSSNFAV AGTPHS YIDTYFDTERSSTYRSKGF 81 

1 : 1 : I I I I I I : | | | | | | | | | M I : I 

Db 62 YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHS RFNPSESSTYSTNGQ 117 

Q y 82 DVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG IKWNGILG 136 

- : I I I I I I I I : I : | | | | | | | . . . , , . , 

Db 118 TFSLQYGSGSLTGFFGYDTLTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMG 170 

Qy 137 LAYATLAKPSSSLETFFDSLVTQANIPN-VFSMQMCGAGLPVAGSGTNGGSLVLGGIEPS 195 

Mill: : ; | : | : : : : | | : : , . . , , . . , , , . . , 

Db 171 LAYPTLSVDGAT— TAMQGMVQEGALTSPIFSVYLSDQ QGSSGGAWFGGVDSS 222 

QV 196 LYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFD 255 



Db 


223 


Qy 


256 


Db 


280 


Qy 


o r\ 


Db 


313 


Qy 


358 


Db 


365 



1 1 I I : : I : : I I : I I I : MM: || M M M M M I M M 



MM I : | I : : : 
TGAQ EDEYGQFLVNCNS IQNLPTLTFI I 312 



III M I Ml I : : | : : | : : 

PSSYI LNNNGYCTVGVEPTYLSAQNSQPLWILGDVFLRSYYSVY 3 64 



M M I : 



RESULT 3 
JE0371 

pepsin C (EC 3.4.23.-) precursor - chicken 
N;Alternate names: pepsinogen C 
C; Species: Gallus gallus (chicken) 

C;Date: 23-Jul-1999 #sequence_revision 23-Jul-1999 #text change ll-May-2000 

C;Accession: JE0371 ~ 

R; Sakamoto, N.; Saiga, H.; Yasugi, S. 

Biochem. Biophys . Res. Commun. 250, 420-424, 1998 

A;Title: Analysis of temporal expression pattern and cis-regulatory sequences of 
chicken pepsinogen A and C. 

A; Reference number: JE0370; MUID : 9844 08 13 ; PMID: 9753645 

A;Accession: JE0371 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-3 8 9 <SAK> 

C; Superf amily : pepsin 

C;Keywords: aspartic proteinase; hydrolase 

Query Match 16.8%; Score 355; DB 2; Length 389; 

Best Local Similarity 28.7%; Pred. No. 1.8e-20; 

Matches 114; Conservative 58; Mismatches 121; Indels 104; Gaps 16; 
Qy 13 ANFLAMVDNLQGDS GRGYYLEMLI GTP PQKLQI LVDTGS SNFAVAGT PHSYI 64 

: I I : I : M M M M M M M M I I I I | : 

Db 56 SNFATAYEPLANNMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLCQSQACANHN — 113 

QY 65 DTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFN TS 110 

II MM:: : : : | | | | | | | M I : I : M 

Db 114 —EFDPNESSTFSTQDEFFSLQYGSGSLTGIFGFDTVTI-QGISITNQEFGLSETEPGTS 170 

QV 111 ELVNIATIFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPN — VFSM 168 

I I : : : M M M : : : M I : | \: : M I 

Db 171 FLYS PFDGILGLAFPSI SAGGATTVMQKMLQENLLDFPVFS F 212 

QV 169 QMCGAGLPVAGS GTNGGS LVLGGI EP S L YKGD I WYT P I KEEWY YQI E I LKLEI GGQS LNL 228 

: I I : I I i I M : : M M I I : I I : : I : M I : I I I I 

Db 213 YLSGQ EGSQGGELVFGGVDPNLYTGQ1TWTPVTQTTYWQIGIEDFAVGGQSSGW 2 66 

QY 22 9 DCREYNADKAIVDSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSET 288 

' : : M M I I : I I : | : M : : : : : | : | : | ;| | 



Db 



2 67 -CSQ--GCQGIVDTGTSLLTVPNQVPTELMQYIG AQADD SGQYVASCSNTE- 314 



Qy 289 PWSYFPKI SI YLRDENS SRSFRITILPQLYIQPMMGAGLNYECY 332 

1,1 I I : | | : IN ||: 
Db 315 — YMPTITFVISGTSFPLPPSAYMLQSNSDYCTVGIESTYLPSQTGQPLW 362 

QY 333 RFGI S PSTNALVI GATVMEGFYVT FDRAQKRVGFAAS 369 

: : I : : I I : I : t I I I : 

Db 363 ILGDVFLRVYYS I YDMGNNQVGFATA 38 8 



RESULT 4 
JC7246 

pepsinogen C - common marmoset 

C; Species: Callithrix jacchus (common marmoset) 

C;Date: 09-Jun-2000 #sequence__revision 09-Jun-2000 #text change 21-Jul~2000 
C;Accession: JC7246 ~~ 
R;Kageyama, T. 

J. Biochem. 127, 761-770, 2000 

A; Title: New world monkey pepsinogens A and C, and prochymosins . Purification 
characterization of enzymatic properties, cDNA cloning, and molecular evolution 
A; Reference number: JC7245 
A; Accession: JC724 6 
A; Molecule type: mRNA 
A; Residues: 1-388 <KAG> 
A;Cross-references : DDBJ: AB038385 
A; Experimental source: strain NW7 91 

C; Comment: This protein, a zymogen of pepsins, is the major proteolytic enzyme 
in vertebrate gastric juices. It plays roles in gastric digestion, and is a 
useful molecular marker for clarifying the evolution of mammalian orders and 
families . 

C; Superf amily : pepsin 

C; Keywords: gastric juice; zymogen 

Query Match 16.6%; Score 351.5; DB 2; Length 388; 

Best Local Similarity 30.1%; Pred. No. 3.4e-20; 

Matches 112; Conservative 56; Mismatches 115; Indels 89; Gaps 17; 

QY 30 YYLEMLI GTPPQKLQILVDTGS SNFAV AGTPHSYIDTYFDTERSSTYRSKGF 81 

1 : 1 : > I I I I I : | | | | M I I I I I I | : | || | | | 

Db 73 Y FGE I S I GT P PQN FL VL FDT G S SNLWVP S VYCQ S QACT S H S RFNPSASSTYSSNGQ 128 

Q y 82 DVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATI FESENFFLPG IKWNGILG 136 

: : : I I I I I I I I : I : | | | | | | | • • ■ I I • | 

Db 129 TFSLQYGSGSLTGFFGYDTLTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMG 181 

Qy 137 IAYATLAKPSSSLETFFDSLVTQANIPN-VFSMQMCGAGLPVAGSGTNGGSLVXGGIEPS 195 

III I = : : I : : : : : | | I : I : : M : : : I I : : I 

Db 182 LAYPALSMGGAT— TAMQGMLQEGALTSPVFSFYLSNQ QG S S GGAVI FGGVDSS 233 

QY 196 LYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFD 255 

I I I I = : I : : I I : I I I : Mil: I I : I I I I : I I : I I : | | : 

Db 234 LYTGQI YWAPVTQELYWQIGIEEFLIGGQASGW-CSE--GCQAIVDTGTSLLTVPQQYMS 290 

QY 256 AWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYF PKI SI YLRDENS 305 

1 : I I I I I 1:1 | : : : 
Db 291 AFLEA TGAQ EDEYGQFLVNCDSIQNLPTLTFII 323 



Qy 306 SRSFRITILPQLYIQPMMGAGLNYECY-RFGISP STNALVT GATVMEGFYVI F 357 

: 1 " I : I 1:1 I : : I : : | : | 

Db 324 -NGVEFPLPPSSYI LSNNGYCTVGVEPT YLS SQNSQPLWI LGDVFLRS YYS VF 375 

Qy 358 DRAQKRVGFAAS 369 

f I I I I I : 

Db 376 DLGNNRVGFATA 387 



RESULT 5 
A29937 

gastricsin (EC 3.4.23.3) precursor - human 
N;Alternate names: pepsin C; pepsinogen C 
C; Species: Homo sapiens (man) 

C;Date: 17-Oct-1988 #sequence_revision 17-Oct-1988 #text change 31-Mar-2000 
C;Accession: A29937; A31811; PX0028; 154213; A91125; A23458 
R;Hayano, T.; Sogawa, K. ; Ichihara, Y. ; Fu j ii-Kuriyama , Y. ; Takahashi K 
J. Biol. Chem. 263, 1382-1385, 1988 

A;Title: Primary structure of human pepsinogen C gene. 
A;Reference number: A29937; MUID : 8 808727 6; PMID:3335549 
A;Accession: A29937 
A;Molecule type: DNA 
A; Residues: 1-3 8 8 <HAY> 

R;Taggart, R.T.; Cass, L.G.; Mohandas, T.K.; Derby, P.; Barr, P.J.; Pals G • 
Bell, G.I. ' 

J. Biol. Chem. 264, 375-379, 1989 

A;Title: Human pepsinogen C (progastricsin) . Isolation of cDNA clones, 
localization to chromosome 6, and sequence homology with pepsinogen A. 
A; Reference number: A31811; MUID : 8907967 9 ; PMID:2909526 
A;Accession: A31811 
A;Molecule type: mRNA 
A; Residues: 1-38 8 <TAG> 

A;Cross-references: GB:J04443; NID:g551175; PIDN : AAA60074 . 1 ; PID:g551176 
R;Athauda, S.B.P.; Tanji, M. ; Kageyama, T.; Takahashi, K. 
J. Biochem. 106, 920-927, 1989 

A; Title: A comparative study on the NH2-terminal amino acid sequences and some 

other properties of six isozymic forms of human pepsinogens and pepsins. 

A; Reference number: PX0023; MUID : 901304 02 ; PMID:2515193 

A;Accession: PX0028 

A;Molecule type: protein 

A; Residues: 17-101 <ATH> 

R;Pals, G . ; Azuma, T.; Mohandas, T.K.; Bell, G.I.; Bacon, J.; Samloff, I.M ; 
Walz, D.A.; Barr, P. J.; Taggart, R.T. 
Genomics 4, 137-148, 1989 

A;Title: Human pepsinogen C (progastricsin) polymorphism: evidence for a sinqle 
locus located at 6p21.1-pter. 

A; Reference number: 154213; MUID: 89290840; PMID:2567697 
A;Accession: 154213 

A; Status: translated from GB/EMBL/DDB J 
A;Molecule type: DNA 
A; Residues: 1-388 <RES> 

A;Cross-references: GB:M23077; NID:gl89830; PIDN : AAA60063 . 1 ; PID < a3 87015 • 
GB:J03063 " y 

A;Note: parts of this sequence, including the amino end and carboxyl ends of the 
mature protein, were determined by protein sequencing 
R;Foltmann, B.; Jensen, A.L. 



Eur. J. Biochem. 128, 63-70, 1982 

A;Title: Human progastricsin. Analysis of intermediates during activation into 
gastricsin and determination of the amino acid sequence of the propart. 
A; Reference number: A91125; MUID : 83079318 ; PMID: 6816595 
A; Accession: A9112 5 
A;Molecule type: protein 

A;Residues: 17-39, » ED ' , 42-51 S 53-64 <FOL> 
A;Note: pro-form; 29-Leu was also found 

A;Note: activation at pH 2 is proposed to involve conformation change, cleavage 

after Phe-42, and cleavage after Leu-59 

C; Genetics : 

A; Gene: GDB : PGC 

A;Cross-references : GDB: 119485; OMIM: 169740 
A; Map position: 6p2 1 . 3-6p2 1 . 1 

A;Introns: 20/2; 70/3; 110/1; 149/3; 216/2; 256/2; 305/3; 338/3 
C; Superf amily : pepsin 

C; Keywords: aspartic proteinase; hydrolase; protein digestion; stomach; zymogen 
F; 1-16/Domain : signal sequence #status predicted <SIG> 
F; 17-59/Domain: propeptide #status experimental <PRO> 
F; 60-3 8 8/ Product : gastricsin #status experimental <MAT> 

Query Match 16.6%; Score 350.5; DB 2; Length 388; 

Best Local Similarity 30.1%; Pred. No. 4e-20; 

Matches 112; Conservative 59; Mismatches 112; Indels 89; Gaps 18; 

Qy 30 YYLEMLIGTPPQKLQILVDTGSSNFAV AGTPHSYIDTYFDTERSSTYRSKGF 81 

I : I : M I I I I : I I | M | | I | | | | | - I | | | : | 

Db 73 YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHS RFNPSESSTYSTNGQ 128 

Q y 82 DVTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIATI FESENFFLPG IKWNGILG 136 

: : : I I I I I I I I : I : | | | | | | | : : : | | : | 

Db 129 TFSLQYGSGSLTGFFGYDTLTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMG 181 

Qy 137 LAYATLAKP S S S LET FFDS LVTQANI PN- VFSMQMCGAGLP VAGS GTNGGS LVLGGI EPS 195 

1 1 ' 1 : : : I : I : : : I I I : : | : : || : : | | | : : | 

Db 182 LAY PAL SVD EAT TAMQGMVQEG7VLTSPVFSVYLSNQ QGS S GGAWFGGVDS S 233 

Q y 196 LYKGD ™YTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFD 255 

M I I : : I : : I I : M I : MM: II :||||:||:|| 

Db 234 LYTGQIYWAPVTQELYWQIGIEEFLIGGQASGW-CSE--GCQAIVDTGTSLLTVPQQYMS 290 

Qy 256 AWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYF PKISIYLRDENS 305 

1 : : : I I M I I : | I : : : 
Db 291 ALLQA TGAQ EDEYGQFLVNCNSIQNLPSLTFI I 323 

®y 306 SRSFRITILPQLYIQPMMGAGLNYECY-RFGISP STNA LVIGATVMEGFYVI F 357 

: I II I : I | : | || : : | : : | : : 

Db 324 -NGVEFPLPPSSYI LSNNGYCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVY 375 

Qy 35 8 DRAQKRVGFAAS 369 

I Mill: 
Db 37 6 D L GNN RVG FATA 387 



RESULT 6 
JC7573 

pepsinogen C - African clawed frog 



N; Alternate names: progas tricsin 

C; Species: Xenopus laevis (African clawed frog) 

C;Date: 30-Jun-2001 #sequence_revision 30-Jun-2001 #text change 03-Aug-2001 
C;Accession: JC7573; PC7118 ~ 
R;Ikuzawa, M. ; Inokuchi, T.; Kobayashi, K. ; Yasumasu, S. 
J. Biochem. 129, 147-153, 2001 

A; Title: Amphibian pepsinogens: Purification and characterization of Xenopus 

pepsinogens, and molecular cloning of Xenopus and bullfrog pepsinogens. 

A; Reference number: JC7573; MUID : 21064922 ; PMID : 1 1134 969 

A; Contents : Stomach 

A; Access ion: JC7573 

A; Molecule type: mRNA 

A; Residues: 1-383 <IKU> 

A; Cross-references : DDB J : AB04537 9 

A; Access ion : PC7118 

A;Molecule type: protein 

A; Residues: 17-68 <IK2> 

C; Comment: This protein is a zymogen for gastric aspartic proteinase, with 

pepsin-like activity. 

C ; Genetics : 

A; Gene: PgC 

C; Super family : pepsin 

C; Keywords: stomach; zymogen 

Query Match 16.4%; Score 346.5; DB 2; Length 383; 

Best Local Similarity 29.9%; Pred. No. 8.3e-20; 

Matches 118; Conservative 60; Mismatches 124; Indels 93; Gaps 23; 

QY 1 ALEPAIASPAGAANFIAMVTJNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

' 1 1 : 1 : : 1 111:111111 : I I I II I I | | | 

Ur> 55 AYEP LSNYMDM S YYGEI S I GTPPQNFLVLFDTGSSNLWVAST- 96 



Db 



Qy 61 HSYIDT YFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLV 113 

1 : I : =11111 : : : I II II : f I IN 
97 —YCQSQACTNHPLFNPSQSSTYSSNQQQFSLQYGTGSLTGILGYDTVTIQ 145 

Q y 114 NIATIFESENFFL PG IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPN 164 

1 : I : 'l II : : : I I I I I I I : : | : : | : : | | : | 

Db 146 NVA— ISQQEFGLSETEPGTNFVYAQFDGILGLAYPSIAVGGAT— TVMQGMM-QQNLLN 200 

Q y 165 — VFSM Q M CGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIG 222 

: ' : I I I I I s I I : : : I I I : : I I : | | : | | | | 

Db 201 QPIFGFYLSGQ S SQNGGEVAFGG VDQN YYTGQI YWT PVT S ET YWQ I GI QGFS IN 254 

QY 223 GQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLAC 2 82 

11= I : : I I I I : I ! : I I II It :::::: s | - 

255 GQATGW-CSQ--GCQAIVDTGTSLLTAPQSVFSSLIQSIG AQQDQNGQYWS 303 

Qy 283 WTNSETPWSYFPKISI YLRDENSSRSFRITILPQLYI-QPMMGAGLNYECYRFGIS 337 

: I : III: III: I I : I I | | | 

304 CSNIQN LPTISFTI SGVSFPLP — PSAYVLQQSSG YC-TIGIMPTYL 347 



Db 



Db 



Qy 338 PSTNA LVIGATVMEGFYVI FDRAQKRVGFAAS 369 

I 1 I : : I : : I : : I : I I I I : 

Db 34 8 PSQNGQPLWILGDVFLREYYSVYDLGNNQVGFATA 3 82 



RESULT 7 
A39314 

gastricsin (EC 3.4.23.3) precursor - bullfrog 
C; Species: Rana catesbeiana (bullfrog) 

C;Date: 19-Jun-1992 #sequence_revision 19-Jun-1992 #text change 22-Jun-1999 
C;Accession: A39314 — 

R;Yakabe, E.; Tanji, M. ; Ichinose, M. ; Goto, S.; Miki, K. ; Kurokawa, K ; Ito 
H.; Kageyama, T.; Takahashi, K. ' 
J. Biol. Chem. 266, 22436-22443, 1991 

A;Title: Purification, characterization, and amino acid sequences of pepsinogens 

and pepsins from the esophageal mucosa of bullfrog (Rana catesbeiana) 

A; Reference number: A39314; MUID : 92 042 18 6; PMID: 1939266 

A;Accession: A39314 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues: 1-384 <YAK> 

A; Cross-references: GB:M73750; NID:g213687; PIDN : AAA4 9530 . 1 ; PID:g213688 
C; Superf amily : pepsin 

C;Keywords: aspartic proteinase; hydrolase; protein digestion 

Query Match 16.1%; Score 339.5; DB 2; Length 384; 

Best Local Similarity 27.5%; Pred. No. 3e-19; 

Matches 108; Conservative 63; Mismatches 125; Indels 97; Gaps 17; 

Q y 14 NFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDT 66 

1 ' : ' MI:!IIMI : i I | | | | | | |:|: 
Db 51 NFATAFEPLANYMDMSYYGEI SIGTPPQNFLVLFDTGSSNLWV PSTYCQSQACTNHP 107 

67 YFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATTFESENFFL 126 
' : : N : I I : : : I II I I : I | | | | | | : , , 

Db 108 QFNPSQSSSYSSNQQQFSLQYGTGSLTGILGYDTVQIQ NIA — ISQQEFGL 156 

Qy 127 PG IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPN — VFSMQMCGAGL 175 

N : : : I I I I I I I : : | : : : | : : | | : | : | : : | 
157 SVTEPGTNFVYAQFDGILGLAYPSIAEGGAT — TVMQGMI-QQNLINQPLFAFYLSG 210 

Q ¥ 176 PVAGSGTNG GSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNA 235 

: I I I : M : : : I I I : : | | : I I : I | | : | | : | : 

211 — QQNSQNGGEVAFGGVDQNYYSGQIYWTPVTSETYWQIGIQGFSVNGQATGW-CSQ G 265 

Q y 236 DK AIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQ LACWTNSE 287 

= I I I : I I : I I MM:::::: | : | . . ( 

Db 266 CQGI VDTGT SLLTAPQSVFS S LMQS I GAQQDQNGQYAVSCSNIQS 310 

Qy 288 TPWSYFP KISIYLRDENS — SRSFRITI LPQLYIQPMMGAGLNYECYRFGI 336 

I 1 I M Ml : (II ||: 
Db 311 LPTI SFTISGVSFPLPPSAYVLQQNSGYCTIGIMPTYLPSQNGQPLW 357 

QY 337 SPSTNALVIGATVMEGFYVIFDRAQKRVGF7VAS 369 

: : I : : I : : I : || II I : 

Db 358 ILGDVFLRQYYSVYDLGNNQVGFAAA 383 



Db 



Db 



RESULT 8 
B43356 

gastricsin (EC 3.4.23.3) precursor - guinea pig 
N;Alternate names: pepsin C 



C; Species: Cavia porcellus (guinea pig) 

C;Date: 03-Feb-1994 #sequence_revision 03-Feb-1994 #text change 22-Jun-1999 
C;Accession: B43356 ~~ 

R;Kageyama, T. ; Ichinose, M. ; Tsukada, S.; Miki, K. ; Kurokawa, K. ; Koiwai, O.; 
Tanji, M.; Yakabe, E . ; Athauda, S.B.; Takahashi, K. 
J. Biol. Chem. 267, 16450-16459, 1992 

A;Title: Gastric procathepsin E and progastricsin from guinea pig. Purification, 
molecular cloning of cDNAs, and characterization of enzymatic properties, with 
special reference to procathepsin E. 

A; Reference number: A43356; MUID : 92355614 ; PMID: 1644829 
A;Accession: B43356 
A; Status: preliminary 
A;Molecule type: mRNA 
A; Residues : 1-394 <KAG> 

A/Cross-references: GB:M88652; NID:gl91296; PIDN : AAA37053 . 1 ; PID:gl91297 
A;Note: sequence extracted from NCBI backbone (NCBIN: 110805, NCBIP : 1 1 08 06 ) 
C; Super family : pepsin 

C; Keywords: aspartic proteinase; gastric juice; hydrolase; protein digestion- 
stomach 

Query Match 15.4%; Score 324.5; DB 2; Length 394; 

Best Local Similarity 29.0%; Pred. No. 4.8e-18; 

Matches 107; Conservative 63; Mismatches 116; Indels 83; Gaps 18; 

Qy 3 0 YYLEMLIGTPPQKLQILVDTGSSNF AVAGTPHS YIDTYFDTERSSTYRSKGF 81 

I : : : : I I I I ( I : I I I I I It : : t I I I I : | | | | : 

Db 79 YFGQISLGTPPQSFQVLFDTGSSNLWVPSVYCSSLACTTH TRFNPRDSSTYVATDQ 134 

^ 82 DVTVKYTQGSWTGFVGEDLVTI PK-GFNTSFLVNIATIFESENFFLPG IK 130 

: : : f t I I I I I : f I till | : | || : 

Db 135 SFSLEYGTGSLTGVFGYDTMTIQDIQVPKQEFGLS ETE PGSDFVYAE 181 

QY 131 WNGILGLAYATLAKPSSSLETFFDSLVTQANI-PNVFSMQMCGAGLPVAGS— GTNGGSL 187 

= •"111111 I : : : : | | : : : ::((:: | | | : : | | 

Db 182 FDGILGLGYPGLSEGGAT--TAMQGLLREGALSQSLFSVYL GSQQGSDEGQL 231 

Qy 188 VLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLL 247 

: I I I : : I I I I I I : : I I : : | | : | | | | | s | : II f : I I : I I 

Db 232 ILGGVDESLYTGDI YWT PVTQELYWQIGIEGFLIDGSASGWCSR GCQGIVDTGTSLL 288 

QY 248 RLPQKVFDAVVTLAVARAS LI PEFSDGFWTGSQLACWTNS ETPWS YFPKI S I YLRDENS SR 307 

: I : I : i : I : : I : : | : : | | : 
Db 289 TVPSDYLSTLVQAIGAEE — NEYGEYF VSCSSIQDLPTLTFVISGV 332 

Q y 308 SFRITILPQLYIQP MMGAGLNYEC YRFGI S P STN — ALVI GATVMEGFYVI FDRA 360 

: I H I : I I : I I : : t : : | : : | | 

Db 333 EFPLSPSAYILSGENYCMVGLESTY VSPGGGEPVWI LGDVFLRSYYSVYDLA 384 

QY 3 61 QKRVGFAAS 3 69 

I I I I I : 

Db 385 NNRVGFATA 393 



RESULT 9 
JC7575 

pepsinogen A - bullfrog 

C;Species: Rana catesbeiana (bullfrog) 



C;Date: 30- Jun-2 001 #sequence_revision 30-Jun-2001 #text change 03-Auq-2001 
C;Accession: JC7575 ~ 

R;Ikuzawa, M. ; Inokuchi, T. ; Kobayashi, K. ; Yasumasu, S. 
J. Biochem. 129, 147-153, 2001 

A; Title: Amphibian pepsinogens: Purification and characterization of Xenopus 

pepsinogens, and molecular cloning of Xenopus and bullfrog pepsinogens 

A; Reference number: JC7573; MUID: 21064922 ; PMID : 11134969 

A; Contents : Stomach 

A; Accession : JC7 57 5 

A;Molecule type: mRNA 

A; Residues: 1-385 <IKU> 

A; Cross-references : DDBJ: AB04537 6 

C; Comment: This protein is a zymogen for gastric aspartic proteinase, with 

pepsin-like activity. 

C; Genetics : 

A; Gene : PgA 

C; Superf amily : pepsin 

C; Keywords: stomach; zymogen 

Query Match 15.0%; Score 318; DB 2; Length 385; 

Best Local Similarity 28.2%; Pred. No. 1.5e-17; 

Matches 108; Conservative 65; Mismatches 136; Indels 74; Gaps 15; 

Q y 4 PALAS PAGAANFLAMVDNLQGDS GRGYYLEMLI GTPPQKLQI LVDT GS SNFAVAGTPHS Y 63 

' : 1 1 ! 1 : M I : : I I I I t I :: I I I I II I I | 
UD bb PSLAQASG EPLQNYMDI EYFGTI S I GTPPQS FTVI FDTGS SNLWV PSVY 103 

Q y 64 IDT YFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIA 116 

: I : : : | | | : : : I : : : | I I : | | : | | | : | | 

Db 104 cssp ACTNHHMFNPQQSSTFQATNTPVSIQYGTGSMSGFLGYDTVQVG NIQITNQIF 160 

Qy 117 ^1 FESE— NFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIP— NVFSMQMCGAG 174 

: : I I 'I : : M I I I I : : I I | | | | : : | | | : . , | . . 

161 GLSQSEPGSFLYYSPFDGILGLAFPSLA — SSQATPVFDNMWNQGLIPQDLFSVYL 214 

QY 175 LPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYN 234 

: I : I : : M : : i I I : : : | : | |:|| : : I I | | : 
Db 215 S S QGQ S GS FVL FGGVDT S Y YT GNLNWVP LTAET YWQ I T VD S I S I GGQVI AC S G 267 

Qy 235 ADKAIVDSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFP 294 

: 1111:11:11 I I : : I : : | : | : : | | | j 

268 SCSAIVDTGTSLLAGPSTPI-ANIQYYIGAN QDSNGQYV— INCNNISNMPTWFT 320 



Db 



Db 



QY 295 KI S I YLRDENSS RS FRI TI LPQLYI QPMMGAGLNYECYRFGI S PSTNALV 344 

I I : I I I : M : I : 

Db 321 INGVQYPLPASAYVRQSQQSCTSGFQAMNLP TSSGDLWI 359 

QY 34 5 I GATVMEGFYVI FDRAQKRVGFA 367 

: I : : i I : I I I I I I 
Db 360 LGDVFI RE YYWFDRANN YVAMA 382 



RESULT 10 
A24608 

gastricsin (EC 3.4.23.3) precursor - rat 

N;Alternate names: pepsinogen C 

N; Contains: pepsin A (EC 3.4.23.1) precursor 



C; Species: Rattus norvegicus (Norway rat) 

C;Date: 30-Jun-1988 #sequence_revision 05-Aug-1994 #text change 18-Jun-1999 
C;Accession: A33510; A24608; C22434; A05145; A61298 

R;Ishihara, T . ; Ichihara, Y. ; Hayano, T.; Katsura, I.; Sogawa, K. ; Fuiii- 

Kuriyama, Y. ; Takahashi, K. 

J. Biol, Chem. 264, 10193-10199, 1989 

A; Title: Primary structure and transcriptional regulation of rat pepsinogen C 
gene . ^ y 

A; Reference number: A33510; MUID : 8 92555 08 ; PMID:2722863 
A; Accession: A33510 
A; Molecule type: DNA 
A; Residues: 1-392 <ISH> 
A;Cross-ref erences : GB:M25985 

R; Ichihara, Y.; Sogawa, K . ; Morohashi, K. ; Fuj ii-Kuriyama, Y. ; Takahashi K 
Eur. J. Biochem. 161, 7-12, 1986 

A; Title: Nucleotide sequence of a nearly full-length cDNA coding for pepsinog 
of rat gastric mucosa. 

A/Reference number: A24608; MUID : 87054020 ; PMID:3780741 
A; Access ion : A24 608 
A; Molecule type: mRNA 
A; Residues: 1-392 <ICH> 

A;Cross-references: GB:X04644; NID:g56880; PIDN : CAA28305 . 1 ; PID*g56881 
R;Ichihara, Y. ; Sogawa, K. ; Takahashi, K. 
J. Biochem. 98, 483-492, 1985 

A; Title: Isolation of human, swine, and rat prepepsinogens and calf 
preprochymosin, and determination of the primary structures of their NH2- 
terminal signal sequences . 

A/Reference number: A22434; MUID : 8 6059312 ; PMID:2415509 
A/Accession: C22434 
A;Molecule type: protein 

A/Residues: 1-19, » X », 21-23 , ' X 25-2 9 <IC2> 

R;Arai, K.M. ; Muto, N. ; Tani, S.; Akahane, K. 

Biochim. Biophys. Acta 788, 256-261, 1984 

A;Title: The N-terminal sequence of rat pepsinogen. 

A; Reference number: A05145; MUID : 8 42 57 697 ; PMID: 6743670 

A;Accession : A05145 

A;Molecule type: protein 

A;Residues: 17-30 , ' Q ', 32-102 , 'A 1 , 1 04-108 , ' L ' , 1 1 0-1 12 <ARA> 
A; Experimental source: Wistar strain 
R;Ichihara, Y. ; Sogawa, K. ; Takahashi, K. 
J. Biochem. 92, 603-606, 1982 

A; Title: Rat gastric prepepsinogen : in vitro synthesis and partial amino- 
terminal signal sequence. 

A;Reference number: A61298; MUID : 83030750 ; PMID: 6182139 
A/Accession: A61298 
A;Molecule type: protein 

A; Residues: 1, 'XX ',4-6, 'x',8-9, 'X',11, 'X', 13-14, 'XXX', 18- 
19, 'X',21, 'X',23, 'XX', 26, 'X' <IC3> 

C; Comment: This enzyme has more restricted specificity than pepsin A. It is th 
ma^or form of pepsinogen in rat gastric mucosa. 
C; Genetics : 

A;Introns: 20/2; 73/3; 113/1; 152/3; 219/2; 259/2; 309/3* 342/3 

A;Note: there are at least two very similar genes for gastricsin in rat 

C; Superf amily : pepsin 

C; Keywords: aspartic proteinase; gastric juice; hydrolase; protein digestion; 

stomach ^ 

F;l-16/Domain: signal sequence #status experimental <SIG> 



F; 17-392/Product : pepsinogen #status experimental <MAT> 
F;17-62/Domain: activation peptide #status experimental <ACT> 
F; 94, 280/ Active site: Asp #status predicted 

F;107-112, 270-275, 314-347/Disulfide bonds: #status predicted 

Query Match 14.8%; Score 313; DB 1 ; Length 392; 

Best Local Similarity 29.5%; Pred. No. 3.9e-17; 

Matches 105; Conservative 56; Mismatches 139; Indels 56; Gaps 16; 

30 YYLEMLI GTPPQKLQI LVDTGS SNFAV AGTPHSYIDTYFDTERSSTYRSKGF 81 

I : I : M I I I I : I I I I I I I I III: | ■ • I I I I ' ' I 

Db 76 YFGEISIGTPPQNFLVTjFDTGSSNLWVSSVYCQSELACTTHA RFNPSKSSTYYTEGQ 131 

QY 82 DVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG IKWNGILG 136 

: : : I I I I I I I I : | : | | in || 

Db 132 TFSLQYGTGSLTGFFGYDTLTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMG 184 

QY 137 LAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS — GTNGGSLVLGGIEP 194 

IN I : I | : : : | : | | | ( | : | | | • | | | • • 

Db 185 LAYPGLS SGGATTALQGMLGE GALSQPLFGVYL— GSQQGSNGGQIVFGGVDK 235 

Qy 195 SLYKGDIW YTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVF 254 

= I I I : I : I : : I | : | | | || |: | : I I I : I I : I I : I : 

236 NLYTGEITWVPVTQELYWQITIDDFLIGDQAS GW-CSSQGC-QGIVDTGTSLLVMPAQYL 293 

Qy 255 DAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI YLRDENSSRSFRITIL 314 

: : : : I : : I : : | : | I : I I 
Db 294 SELLQTIGAQE— GEYGEYF VSCDSVSS LPTLSFVL NGVQFPLS 335 

QY 315 PQLY- IQPMMGAGLNYECYRFGI S PSTNALVT GATVMEGFYVI FDRAQKRVGFAAS 369 

I'll : I : : | : : | | | | : I I I I 

Db 336 PSSYIIQEDNFCMVGLESISLTSESGQPLWILGDVFLRSYYAIFDMGNNKVGLATS 391 



Db 



RESULT 11 
A41443 

pepsin (EC 3.4.23.-) precursor, embryonic - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 05-Jun-1992 #sequence_revision 05-Jun-1992 #text change 21-Jul-2000 
C;Accession: A41443 ~" 

R;Hayashi, K. ; Agata, K.; Mochii, M. ; Yasugi, S.; Eguchi, G. ; Mizuno, T. 
J. Biochem. 103, 290-296, 1988 

A; Title: Molecular cloning and the nucleotide sequence of cDNA for embryonic 

chicken pepsinogen: phylogenetic relationship with prochymosin. 

A; Reference number: A41443; MUID : 88227903; PMID: 3131317 

A;Accession: A41443 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-383 <HAY> 

A; Cross-references: GB:D00215; NID: g2760810; PIDN : BAA00153 . 1 ; PID:g222853 
C; Super family : pepsin 

C; Keywords: aspartic proteinase; hydrolase; protein digestion 

Query Match 14.5%; Score 306.5; DB 2; Length 383; 

Best Local Similarity 26.9%; Pred. No. 1.2e-16; 

Matches 96; Conservative 62; Mismatches 128; Indels 71; Gaps 12; 



Qy 30 YYLEMLIGTPPQKLQILVDTGSSNFAVAG TPHSYIDTYFDTERSSTYRSKGFDVTV 85 

M ••MUM :: I I I I I I I : | | : : | | | | : | | 

Db 76 YYGTISIGTPPQDFTWFDTGSSNLWVPSVSCTSPACQSHQMFNPSQSSTYKSTGQNLSI 135 

Qy 86 KYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPGIKWNGILGLAYATLAKP 145 

' 1 I M I I I : : I : I : : | I | | : : I : - I I I I I I • I I 
Db 136 HYGTGDMEGTVGCDTVTVASLMDTNQLFGLST-SEPGQFFV- YVKFDGILGLGYPSLA 191 

QY 146 S S SLETFFDSLVTQANI - PNVFSMQMCGAGLPVAGSGTNGGS LVLGGI EPS LYKGDIWYT 204 

: I I : : I : : : | : | | : : : I : I M I : I : I I • 

Db 192 ADGITPVFDNMVNESLLEQNLFSVYLSREPM GSMWFGGIDES YFTGSINWI 243 

QY 205 PI KEEWYYQI EI LKLEI GGQSLNLDCREYNADKAI VDSGTTLLRLPQKVFDAVVEAVARA. 264 

' : i I : I I : : : | : : : I I : | : | | : | : | : . t i 

Db 244 PVSYQGYWQISMDSIIVNKQEIACS SGCQAI IDTGTSLVAGPASDINDIQSAVG-- 297 

QY 2 65 SLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITILPQLYIQPMMG 32 4 

1 1 I I I : I : : : | 

ANQNTYGEY SVNCSHILAMPDWF— VIG 324 



Db 298 

Qy 325 AGLNY 



Db 



ECYRFGISPSTNALVIGATVMEGFYVIFDRAQKRVGFA 367 
1 ' ' ' : | : : : | : : | | | | | | | | | | 

325 -GIQ YPVPALAYTEQNGQGTCMS S FQNS SADLWI LGDVFI RVYYS I FDRANNRVGLA 380 



RESULT 12 
KHMSD 

cathepsin D (EC 3.4.23.5) precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 31-Dec-1991 #sequence_revision 31-Dec-1991 #text change 18-Jun-1999 
C;Accession: 148278; S14704; S12587 ~ 
R;Hetman, M. ; Perschl, A.; Saftig, P.; Von Figura, K. ; Peters, C. 
DNA Cell Biol. 13, 419-427, 1994 

A;Title: Mouse cathepsin D gene: molecular organization, characterization of the 
promoter, and chromosomal localization. 

A;Reference number: 148278; MUID : 9428 0 622 ; PMID : 8 011168 
A; Access ion: 14 8278 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-410 <RES> 

A;Cross-references: EMBL:X68378; NID:g50302; PIDN: CAA4 8453 . 1 ; PID'q817945 
R;Diedrich, J.F.; Staskus, K.A. ; Retzel, E.F.; Haase, A.T. 
Nucleic Acids Res. 18, 7184, 1990 

A; Title: Nucleotide sequence of a cDNA encoding mouse cathepsin D 
A; Reference number: S14704; MUID : 91088345 ; PMID:2263503 
A;Accession: S14704 
A;Molecule type: mRNA 
A; Residues: 1-410 <DIE> 

A;Cross-references: EMBL:X53337; NID:g50300; PIDN : CAA37 423 . 1 ; PID-a50301 
R;Grusby, M.J.; Mitchell, S.C.; Glimcher, L.H. 
Nucleic Acids Res. 18, 4008, 1990 

A;Title: Molecular cloning of mouse cathepsin D. 
A;Reference number: S12587; MUID: 90326544 ; PMID:2374732 
A; Accession: S12 58 7 
A;Molecule type: mRNA 
A; Residues: 1-410 <GRU> 

A;Cross-references: EMBL:X52886; NID:g50298; PIDN : CAA37 067 . 1 ; PID:g50299 



C; Genetics : 

A;Introns: 23/2; 76/3; 118/1; 157/3; 233/2; 274/2; 322/3; 355/3 
C; Function: 

A; Description: limited specificity endopeptidase 
A; Pathway: intracellular protein degradation 
C; Super f amily : pepsin 

C;Keywords: aspartic proteinase; glycoprotein; hydrolase; lysosome; protein 
degradation 

F; 1-2 0/ Domain : signal sequence #status predicted <SIG> 
F;21-64/Domain: propeptide #status predicted <PRO> 

F;65-410/Product: cathepsin D, single-chain form Status predicted <MAT> 
F; 91-160, 110-117,284-288, 327- 3 64 /Disulfide bonds: #status predicted 
F; 97, 293/Active site: Asp #status predicted 

F;134,261/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 14.5%; Score 306.5; DB 1; Length 410; 

Best Local Sxmilarity 27.5%; Pred. No. 1.4e-16; 

Matches 103; Conservative 64; Mismatches 123; Indels 85; Gaps 15; 

Q y 30 YYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDT YFDTERSSTYRSKGFDV 83 

M : : I I I I I | : : | | | f | | | . | . . . . . , , , , , 

Db 79 YYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSF 138 

Q y 84 TVKYTQGSWTGEVGEDLVTIPKGFNTSFLVNIAT IFESENFFLPGI KWNGIL 135 

: I I I : I : : : I | : : | : | | | | | | | | | • • | I I 

Db 139 DIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIF-GEATKQPGIVFVAAKFDGIL 197 

Qy 136 G ^ YAT] ^ PS SSLETFFDSLVTQANI-PWFSMQMCGAGLPVAGSGTNGGSLVLGGIEP 194 

I : I : : : : : I I : I : I : I : I I | | | | | : | | | - 

Db 198 GMGYPHIS— VNNVLPVFDNLMQQKLVDKNIFSFY LNRDPEGQPGGELMLGGTDS 250 

QY 195 SLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVTDSGTTLLRLPQKVF 254 

1 1 : : 1 : : I : f : : : I I : I : | | | : : I I I I : I I : | | | : 

Db 251 K YYHGELSYLNVTRKAYWQVHMDQLEVGNE-LTL-CK— GGCEAIVDTGTSLLVGPVEEV 306 

255 DAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITIL 314 
Db 307 KELQKAIGAVPLI QGEYMI PCEKVSSL 333 

QY 315 PQLYIQPMMGAGLNYEC YRFGIS P S TNALVI GATVMEG 352 

I : I : : : I I I | | | : | , ( . . , 

Db 334 PTVYLK ^G-GKNYELHPDKYILKVSQGGKTICLSGFMGMDIPPPSGPLWILGDVFIGS 390 

QY 353 FYVI FDRAQKRVGFA 367 

•I : I I I Mil! 
Db 3 91 YYTVFDRDNNRVGFA 4 05 



RESULT 13 
KHRTD 

cathepsin D (EC 3.4.23.5) precursor - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 31-Dec-1991 #sequence_revision 31-Dec-1991 #text change 18-Jun-1999 
C;Accession: S13111; C31918; JQ1177; PQ0222 
R;Birch, N.P.; Loh, Y.P. 

Nucleic Acids Res. 18, 6445-6446, 1990 

A; Title: Cloning, sequence and expression of rat cathepsin D. 



A; Reference number: S13111; MUID: 91057150; PMID:2243802 
A;Accession: S13111 
A; Molecule type : mRNA 
A; Residues: 1-407 <BIR> 

A;Cross-references: EMBL:X54467; NID:g55881; PIDN : CAA38349 . 1 ; PID:g55882 
R;Yonezawa, S.; Takahashi, T.; Wang, X.; Wong, R.N.S.; Hartsuck, J. A. ; Tang, 
J. Biol. Chem. 263, 16504-16511, 1988 

A;Title: Structures at the proteolytic processing region of cathepsin D. 

A;Reference number: A92681; MUID : 8 9034 127 ; PMID:3182800 

A; Access ion : C31918 

A;Molecule type: protein 

A;Residues: 134-162 T 1 , 164-170 <YON> 

R;Fujita, H.; Tanaka, Y. ; Noguchi, Y. ; Kono, A.; Himeno, M. ; Kato, K. 
Biochem. Biophys . Res. Commun. 179, 190-196, 1991 

A;Title: Isolation and sequencing of a cDNA clone encoding rat liver lysosoma 
cathepsin D and the structure of three forms of mature enzymes. 
A; Reference number: JQ1177; MUID: 91354249; PMID: 1883350 
A; Accession : JQ1177 
A;Molecule type: mRNA 

A;Residues: 1-14 , ' A' , 16-2 04 , ' N ' , 2 06-2 61 , ' N ' , 2 63-4 07 <FUJ> 
A; Accession: PQ0222 
A;Molecule type: protein 

A;Residues: 65-74 ; 1 18-127 ; 1 65-174 <FU2> 
A; Experimental source: liver 

C; Comment: Cathepsin D in rat liver lysosome occurs as a mixture of both a 
single chain form and two types of two chain forms. 
C; Function : 

A; Description: limited specificity endopeptidase 
A; Pathway: intracellular protein degradation 
C; Superf amily : pepsin 

C; Keywords: aspartic proteinase; glycoprotein; hydrolase; lysosome; protein 
degradation 

F; 1-2 0/ Domain : signal sequence #status predicted <SIG> 
F;21-64/Domain: propeptide #status predicted <PRO> 

F;65-407/Product: cathepsin D, 43K single-chain form #status predicted <MAT> 
F;65-164/Product: (or 65-165) cathepsin D 12K light chain Status predicted 
<MA2> 

F;65-117/Product: cathepsin D 9K light chain #status predicted <MA4> 

F;118-407/Product: cathepsin D 34K heavy chain #status predicted <MA5> 

F;165-407/Product: (or 166-407) cathepsin D 30K heavy chain Status predicted 
<MA3> 

F;91-160, 110-117, 281-285, 324-361/Disulf ide bonds: Status predicted 
F;97,290/Active site: Asp #status predicted 

F;134,258/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 14.4%; Score 305; DB 1; Length 407; 

Best Local Similarity 29.4%; Pred. No. 1.8e-16; 

Matches 104; Conservative 61; Mismatches 143; Indels 46; Gaps 15 

Qy 30 YYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDT YFDTERSSTYRSKGFDV 83 

I I I : I I I I I I : : I I! I I I I : | : : : : : | | | [ | 

Db 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWVHHKYNSDKSSTYVPCNGTSF 138 

Q y 84 TVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI FESENFFLPGI KWNGILGLA 138 

: I I I : I : : : I I : : | : : || | | | s | : : I I I I : 

Db 139 DIHYGSGSLSGYLSQDTVSVPCKSDLGGIKVEKQIF-GEATKQPGVVFIAAKFDGILGMG 197 



Qy 


139 


Db 


198 


Qy 


198 


Db 


251 


Qy 


258 


Db 


304 


Qy 


318 


Db 


349 



M : I : I : I : II | : | | | | : | | | 



— - - — u-'au v t/uui x j_ij_jr\_Lj it 

1 : : I : : I : I : : : I I : I : I | | : : M I I : I I : I I | 



I I 



I I : I : I : | : 



11 : N I II: I : : I : : I : | | | | ( | | 



RESULT 14 
REMSK 

renin (EC 3.4.23.15) precursor, renal - mouse 

N;Alternate names: angiotensin-f orming enzyme; angiotensinogenase; renin 1 
C; Species: Mus musculus (house mouse) 

C;Date: 30-Jun-1987 #sequence_revision 30-Jun-1987 #text change 18-Jun-1999 
C;Accession: A00989; S07636; A22766; A22058; 157576; A05137; JH0083 
R;Holm, I.; Olio, R. ; Panthier, J. J.; Rougeon, F. 
EMBO J. 3, 557-562, 1984 

A; Title: Evolution of aspartyl proteases by gene duplication: the mouse renin 

gene is organized in two homologous clusters of four exons . 

A;Reference number: A00989; MUID : 84 1 82525 ; PMID: 6370686 

A;Accession: A00989 

A; Molecule type: DNA 

A; Residues: 1-402 <HOL> 

A; Cross-references : EMBL:X00850 

R;Kim, W.S.; Murakami, K. ; Nakayama, K. 

Nucleic Acids Res. 17, 9480, 1989 

A; Title: Nucleotide sequence of a cDNA coding for mouse Renl preprorenin 
A;Reference number: S07636; MUID : 90067 953 ; PMID:2685761 
A; Accession: S 07 636 
A;Molecule type: mRNA 
A; Residues: 1-4 02 <KIM> 

A/Cross-references: EMBL : X16642 ; NID:g53930; PIDN : CAA34 636 . 1 ; PID-g53931 
R;Mullins, J.J.; Burt, D.W.; Windass, J.D.; McTurk, P.; George, H\ ; Brammar, 
W.J. 

EMBO J. 1, 1461-1466, 1982 

A; Title: Molecular cloning of two distinct renin genes from the DBA/2 mouse 
A; Reference number: A90968; MUID : 842 07 899 ; PMID: 6327270 
A; Accession : A22766 
A; Molecule type: mRNA 
A;Residues: 2 69-314 , » D 1 , 3 1 6 <MUL> 

R;Panthier, J.J.; Dreyfus, M. ; Roux, D.T.L.; Rougeon, F. 
Proc. Natl. Acad. Sci. U.S.A. 81, 5489-5493, 1984 

A;Title: Mouse kidney and submaxillary gland renin genes differ in their 5' 
putative regulatory sequences. 

A; Reference number: A22058; MUID : 8 42 98 161 ; PMID: 6089205 
A; Accession : A22 05 8 
A;Molecule type: DNA 
A; Residues: 1-30 <PAN> 



R;Field, L.J.; Philbrick, W.M. ; Howies, P.N.; Dickinson, D.P.; McGowan RA • 
Gross , K. W. r ' * * 

Mol. Cell. Biol. 4, 2321-2331, 1984 

A;Title: Expression of tissue-specific Ren-1 and Ren-2 genes of mice: 
Comparative analysis of 5' -proximal flanking regions. 
A; Reference number: 157576; MUID: 85085936; PMID: 6392850 
A;Accession: 157576 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues : 1-31 <RES> 

A;Cross-references: GB:K02800; NID:g200689; P I DN : AAA4 0044.1; PID-g200690 
C; Comment: The only known function of renal renin is to release angiotensin I 
from angiotensinogen in the plasma, initiating a cascade of reactions that 
produces an elevation of blood pressure and increased sodium retention by the 
kidney. J 

C;Comment: Renal renin is synthesized by the juxtaglomerular cells of the kidney 
in response to decreased blood pressure and sodium concentration. 
C; Genetics : 

A; Gene: Ren-1 

A;Introns: 31/2; 81/3; 123/1; 162/3; 228/2; 268/2; 316/3; 349/3 
C; Superf amily : pepsin 

C;Keywords: aspartic proteinase; blood pressure control; glycoprotein- 
hydrolase; kidney; plasma 

F; 1-21/Domain: signal sequence #status predicted <SIG> 
F;22-64/Domain: propeptide #status predicted <PRO> 
F;65-402/Product: renin #status predicted <MAT> 

F;69,139,320/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F;102,287/Active site: Asp #status predicted 

Query Match 14.4%; Score 304; DB1; Length 402; 

Best Local Similarity 29.0%; Pred. No. 2.1e-16; 

Matches 108; Conservative 60; Mismatches 152; Indels 52; Gaps 17; 

Qy 6 LAS PAGAANF1AMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHS Y 63 

n 111 1 : 1 11 I I I : I I I I I I : : : i I I I : I | | | 

Db 70 LTSPWLTNYL NTQ YYGEI GI GTPPQTFKVI FDTGSANLWVPSTKCS RLY 119 

Qy 64 1 DT YFDTERS STYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIAT I F 119 

• : : : : M : I I | f : | | | | . . | | , . , . , 

120 LACGIHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDSVTV-GGITVTQTFGEVTEL 178 

Qy 120 ESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANI-PNVFSMQMCGAGLPVA 178 

If I : : I : I I : : I : : | | : : : | : | | | : 

Db 179 PLIPFML— AKFDGVLGMGFP— AQAVGGVTPVFDHILSQGVLKEEVFSVYY NR 228 

Qy 179 GSGTNGGSLVL GGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 238 

'I 'I Mill : I I : | : | | : : || : . - | j | , , , 

Db 229 GSHLLGGEWLGGSDPQHYQGNFHYVSISKTDSWQITMKGVSVG — SSTLLCEEGCA V 284 

Qy 239 ^^DSGTTLLRLPQKVFDAVVEAV-ARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKIS 297 

: I I : I : : : | : : : | : | : | | : . , , . ( , , , 

Db 285 WDTGSSFI SAPTSSLKLIMQALGAKEKRIEEY WNC SQVP TLPDIS 331 

QY 298 I YLRDENS SRS FRI T I LPQLYIQPMMGAGL-NYECYRFGI S PSTNAL- VI GATVMEGFYV 355 

I | : : : : : | | : I I I : | : | | | : | | 

Db 332 FDL GGRAYTLSSTDYVLQYPNRRDKLCTLALHAMDIPPPTGPVWVLGATFIRKFYT 387 



Db 



Qy 35 6 I FDRAQKRVGFA 367 

Ml I : I I I 
38 8 EFDRHNNRIGFA 399 



RESULT 15 
JC7574 

pepsinogen A - African clawed frog 

C; Species: Xenopus laevis (African clawed frog) 

C;Date: 3.0-Jun-2001 #sequence_revision 30-Jun-2001 #text change 03-Auq-2001 
C;Accession: JC7574; PC7119 ~ 
R;Ikuzawa, M. ; Inokuchi, T.; Kobayashi, K. ; Yasumasu, S. 
J. Biochem. 129, 147-153, 2001 

A; Title: Amphibian pepsinogens: Purification and characterization of Xenopus 

pepsinogens, and molecular cloning of Xenopus and bullfrog pepsinogens 

A; Reference number: JC7573; MUID : 2 1064 922 ; PMID: 11134969 

A; Contents: Stomach 

A; Access ion : JC7 57 4 

A;Molecule type: mRNA 

A; Residues: 1-384 <IKU> 

A; Cross- references : DDBJ: AB04 538 0 

A;Accession: PC7119 

A;Molecule type: protein 

A;Residues: 16-35;57-76 <IK2> 

C; Comment: This protein is a zymogen for gastric aspartic proteinase, with 

pepsin-like activity. 

C; Genetics : 

A; Gene : PgA 

C; Superf amily : pepsin 

C; Keywords: stomach; zymogen 

Query Match 14.3%; Score 302; DB 2; Length 384; 

Best Local Similarity 27.5%; Pred. No. 2.8e-16; 

Matches 103; Conservative 71; Mismatches 144; Indels 56; Gaps 17; 

Q y 4 PALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSY 63 

111 = s I I I I : I I I I I I : : : | | | | : | | | | 
Db 54 PT LAQ S SAETLQNYMDI EYYGTI S I GTPPQEFTVI FDTGSANLWV PSVY 102 

Q ^ 64 IDTY FDTERSSTYRSKGFDVT VKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIA 116 

: I : : : I | | : : : I : : : I | | : | | : | | | : | | : 

Db 103 CS SQACSNHNRFNPQQSSTFQATNTPVSIQYGTGSMSGFLGYD--TLQVG-NIQISNQMF 159 



Db 



Q ^ 117 TIFESE-NFFLPGIKWNGILGLAYATIAXPSSSLETFFDSLVTQANIP-NVFSMQMCGAG 174 

: 'I' II : : I I II I I : : : I I ! I I : : : | || I : I I : : 
160 GLSESEPGSFLYYSPFDGILGLAFPSIA-- SSQATPVFDNMWSQGLIPQNLFS VYL 213 

Q y 175 LPVAGSG ™GGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYN 234 

1 1 ' :: II:: I I I : : I: I I : I I : : I | | : | : 
Db 214 SSDGQTGSYVLFGGVDNSYYSGSLNWVPLTAETYWQITLDSVSINGQV — IACSQ — 266 

Qy 235 ADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFP 294 

: : I M I : I I : I : | | : : | | : | : | : : | | | , 

Db 267 SCQAIVDTGTSLMTGPSTPI-ANIQNYIGAS— QDSNGQYV— INCNNI SNMPTIVF- 318 

QY 295 KISIYLRDENSSRSFRITILPQLYI-QPMMGAGLNYECYRFGISPSTNALVIGATVMEGF 353 

: : : | | : | | : : : | . . . | . . 



Db 319 



-TINGVQ YPLS P SAYVRQNQQGCS SGFQAMNLPTN- SGDLWI LGDVFI RQ Y 367 



Qy 354 YVIFDRAQKRVGFA 367 

: : I I II I I 
Db 36 8 FTVFDRANNYVAIA 381 



Search completed: March 4, 2004, 15:41:00 
Job time : 23.0277 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on 



March A, 2004, 15:39:01 



Search time 45.3511 Seconds 
(without alignments) 

1890.324 Million cell updates/sec 



Title: 

Perfect score 
Sequence : 

Scoring table 



Searched : 



US-0 9-668-3 14C-2__COPY_63_4 68 
2113 

1 AL E P ALAS P AGAAN FLAMVD STEDVASNCVPAQSLSEPIL 406 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



809742 seqs, 211153259 residues 



Total number of hits satisfying chosen parameter: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



809742 



Post-processing 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : 



Published Applications AA: * 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 



/ cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB .pep : * 
/ cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 
/ cgn2_6/p todata/2 /pubpaa/US 06_NEW_PUB. pep: * 
/ cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep : * 
/ cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep : * 
/ cgn2_6/ ptodata/ 2 /pubpaa/ PCTUS_PUBCOMB . pep : * 
/ cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep : * 
/ cgn2_6/p todata/2 /pubpaa/US 08_PUBCOMB. pep: * 
/ cgn2_6/ptodata/ 2/pubpaa/US09A_PUBCOMB . pep : * 
/cgn2_6/ptodata/2 /pubpaa/US 0 9B__PUBCOMB. pep : * 
/ cgn2_6/ptodata/2/pubpaa/US09C__PUBCOMB.pep : * 
/ cgn2_6/ ptodata/2 / pubpaa/US 0 9_NEW_PUB . pep : * 

/ cgn2_6/ptodata/2/pubpaa/USl0A_PUBCOMB.pep : * 
/ cgn2_6/ptodata/2 /pubpaa/US 10B_PUBCOMB. pep : * 
/ cgn2__6/ptodata/2 /pubpaa/US 1 0C_PUBCOMB . pep : * 
/ cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep : * 
/cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: * 
/ cgn2_6/pt odat a/2 /pubpaa/US 60_PUBCOMB. pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 

US- 10- 10 6- 69 8- 63 66 

; Sequence 6366, Application US/10106698 

; Publication No. US2 00301 0 969 0A1 

; GENERAL INFORMATION: 

; APPLICANT: Ruben et al. 



; TITLE OF INVENTION: Colon and Colon Cancer Associated Polynucleotides and 
Polypeptides 

FILE REFERENCE: PA005P1 

CURRENT APPLICATION NUMBER: US/10/106, 698 
CURRENT FILING DATE: 2002-03-27 
PRIOR APPLICATION NUMBER: PCT/US 00/2 6524 
PRIOR FILING DATE: 2000-09-28 
PRIOR APPLICATION NUMBER: US 60/157,137 
PRIOR FILING DATE: 1999-09-29 
PRIOR APPLICATION NUMBER: US 60/163,280 
PRIOR FILING DATE: 1999-11-03 
NUMBER OF SEQ ID NOS : 8564 
SOFTWARE: PatentlnVer. 3.0 
SEQ ID NO 6366 
LENGTH: 4 81 
TYPE: PRT 

ORGANISM: Homo sapiens 
US- 10- 106- 69 8- 63 66 

Query Match 100.0%; Score 2113; DB 14; Length 481; 

Best Local Similarity 100.0%; Pred. No. 3.9e-203; 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; Gaps 0; 

QY 1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

1 > 1 > I I I II I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | , 

Db 26 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 85 

Q y 61 HSYID TYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 120 

I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I II I I | | | | | | | | | | | | | | | | 

Db 86 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 145 

QY 121 SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 

" I I I I I I I I I I I I I I I M I II I I I I I I | | | | | | | | | | | | M I II I I M I 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 2 05 



Db 146 
Qy 181 
Db 206 



GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 240 
1 1 1 I I I I I I I I I I I I I I I I | | | | | | | I I I I I I II I I I I I I I I I I I I I I I I I I II I I I | | | 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 2 65 

Q y 241 DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI YL 300 

1 1 1 ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II II 

Db 266 DSGTTLLRLPQKVFDAVVE1AVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI YL 325 

Qy 301 RDENS SRSFRITILPQLYIQPMMGAGLNYECYRFGI S PSTNALVIGATVMEGFYVI FDRA 360 

1 ' M I I I I I | | | | I I I I I I I I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

326 RDENS S RS FRIT I LPQLYIQPMMGAGLNYECYRFGIS PSTNALVIGATVMEGFYVI FDRA 385 

Qy 36 1 QKRVGFAASPCAEIAGAAVSEI SGPFSTEDVASNCVPAQSLSEPI L 406 

I I I I I I I I M I I I I II I | | | | | | | | | | | | | | | | | | | || | | | | | | | | 
Db 38 6 QKRVG FAAS P CAE I AGAAVS EI S G P FS T EDVAS NCVPAQ S L S E P I L 431 



Db 



RESULT 2 
US-09-794-927-2 

; Sequence 2, Application US/09794927 
; Patent No. US20010016324A1 
; GENERAL INFORMATION: 



AND 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF 



Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 
INVENTION : ALZHEIMER 



S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 



TITLE OF INVENTION: USES 

TITLE OF INVENTION: THEREFOR 

FILE REFERENCE: 28341/6280FG 

CURRENT APPLICATION NUMBER: US/09/794,927 

CURRENT FILING DATE: 2001-02-27 



PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE; 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
FILING DATE: 



NUMBER 



1999-10-13 



NUMBER 
1999 

NUMBER 
1999 

NUMBER 



NUMBER 



09/416, 901 



60/155, 493 
09-23 

09/404, 133 
09-23 

PCT/US99/20881 



1999-09-23 



60/101,594 



1998-09-24 



PRIOR 

NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln Ver. 2 
SEQ ID NO 2 
LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-794-927-2 



0 



Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; Gaps 



0; 



Qy 


1 


ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 
1 ' I I 1 1 1 1 ! 1 1 1 1 1 1 I I | || | | | | | | | | | | | | | || | | | | , | , | , , | 

ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
1 1 1 ' 1 ! 1 1 i 1 1 1 M 1 1 1 1 1 1 1 1 1 II || | | | || | | | | | | | | | | | | | | | | | , , | | , | 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLWIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

1 1 1 "> 1 1 I 1 1 1 1 1 1 1 1 f 1 1 1 I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

1 1 " 1 1 M I 1 1 1 1 | | | | | I | | | | | | | | | | | | | | , | | 1 1 1 1 1 II 1 1 1 1 1 1 1 I I 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 
1 1 11 1 ' i 1 1 1 1 1 M 1 1 1 1 1 1 1 I I 1 | | | | | | | | | | | | | | | , | | | | | | || | || | , 

DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTN7VLVIGATVMEGFYVIFDRA 

1 1 1 > M 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I | I | 1 1 1 1 1 1 1 1 | | | | Ml 

RDENSSRSFRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVIGATVMEGFYVI FDRA 


360 


Db 


363 


422 



361 QKRVGFAAS PCAE I AGAAVS E I S GP FS T EDVASN CVPAQ S L S E P I L 406 
M I I I I I I I I I I I I I I I I I I I M | | | | M II I I M I I I I I | | | | | | 
Db 423 QKRVGFAAS PCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 468 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



RESULT 3 
US-09-795-847-2 

Sequence 2, Application US/09795847 
Patent No. US2 00100 182 08A1 
GENERAL INFORMATION: 
APPLICANT: Gurney, Mark E. 

Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND ' 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280DE 
CURRENT APPLICATION NUMBER: US/09/795, 847 
CURRENT FILING DATE: 2001-02-28 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/20881 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 2 
LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-795-847-2 

Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

QY 1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

I ' I I I I I I I I I I I I I | | | | | I I I I I I I I I | | | | | | I I I I I I I I I I I I I I I I I I 

Db 63 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 122 

Q y 61 HS YI DTYFDTERS ST YRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATI FE 120 

1 1 ' I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I II I I I I I I I I I I I I 

Db 123 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 182 



121 SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 

1 1 1 ' I I I I I I I I I I I I I | | | | | | I I I I I I I I I I M I I I I I I I I 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 242 



Db 183 



Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA.lv 

MINIUM 1 M 1 1 I 1 1 1 I 1 I I I | f ! I 1 M 1 1 1 1 I | 1 f | | | M | | | I M 1 I 1 I! 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA.lv 


240 


Do 


24 3 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 1 " 1 f 1 1 1 1 If 1 1 1 I 1 1 1 1 1 1 I I 1 1 1 1 1 1 | | i M 1 1 1 1 | | | | | | | 1 1 1 1 I 1 1 1 

DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Dd 


303 


362 


Qy 


301 


RDENSSRS FRITILPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FDRA 

I | | | M | | | | | | | | | 1 | 1 | 1 1 | | l 1 1 1 1 1 | 1 t 1 1 1 1 1 1 1 1 it 1 1 | , tiiii(iii 

1 ' 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 f ' > I I I I I I I | | | | f | 1 II f 1 1 1 1 1 1 1 1 I | I I | 

RDENS S RS FRI T I LPQLYI QPMMGAGLNYEC YRFGI S P S TNALVI GATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
1 ' f 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | | | | 
QKRVGFAAS P CAE I AGAAVS E I S GP FS TEDVASNCVPAQ S L S E P I L 4 68 




Db 


423 





APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



RESULT 4 
US-09-794-743-2 

Sequence 2, Application US/09794743 
Patent No. US2 001 002 1391A1 
GENERAL INFORMATION: 
APPLICANT: Gurney, Mark E. 

Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND ' 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280BC 
CURRENT APPLICATION NUMBER: US/ 0 9/ 7 94 , 7 4 3 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US 9 9/2 0 8 8 1 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: PatentlnVer. 2.0 
SEQ ID NO 2 
LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-794-743-2 

Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 



Db 


63 


<> 1 1 1 1 1 f 1 i f 1 1 1 f 1 1 1 1 1 1 1 ! 1 1 1 1 I II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | M 1 

ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


122 




61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 

< ' M 1 1 1 1 1 I 1 1 1 1 1 1 I 1 1 | | | | | | | | 1 1 1 1 1 1 1 1 | | 1 1 M I 1 1 1 M 1 1 1 f 1 | | | | | | 1 1 
HS YIDTYFDTERS STYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIATI FE 


120 


Db 


123 


182 


Qy 


121 


S ENFFL PGI KWNGI LGLAYATLAKP S S S LET FFD S LVTQANI PNVFSMQMCGAGLPVAGS 

I' 1 1 I ! I M 1 I 1 1 1 1 1 I 1 1 I | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 I I I I I | | | I | | 1 1 

SENFFLPGI KWNGI LGLAYATLAKPSSS LET FFDS LVTQANI PNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

M 1 ' M 1 1 1 1 1 1 1 II 1 1 1 1 1 | | | 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | 1 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 1 ' 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 M 
DSGTTLLRLPQWFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVI FDRA 

1 1 1 ( 1 1 1 1 1 1 I 1 1 | | 1 1 I 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 | i i t i . i i i i ) i 

1 1 ' 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M | | | | | | M | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

RDENS S RS FRITI LPQLYIQPMMGAGLN YECYRFGI S PSTNALVI GATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEI SGPFSTEDVASNCVPAQSLSEPIL 4 06 

1 1 ' 1 1 1 1 1 M 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I I j | | 

QKRVGFAASPCAEIAGAAVSEI SGPFSTEDVASNCVPAQSLSEPIL 4 68 




Db 


423 





RESULT 5 
US-09-794-748-2 

Sequence 2, Application US/09794748 
Patent No. US2 002 0037315A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND ' 

TITLE OF INVENTION: USES 

TITLE OF INVENTION: THEREFOR 

FILE REFERENCE: 28341/6280JL 

CURRENT APPLICATION NUMBER: US/ 09/7 94 , 74 8 

CURRENT FILING DATE: 2001-02-27 

PRIOR APPLICATION NUMBER: 09/416,901 

PRIOR FILING DATE: 1999-10-13 

PRIOR APPLICATION NUMBER: 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/2 08 8 1 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 60/101,594 

PRIOR FILING DATE: 1998-09-24 

NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patent In Ver. 2.0 



; SEQ ID NO 2 

LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-794-748-2 



Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; Gaps 0; 

Qy 1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILvT)TGSSNFAVAGTP 60 

1 1 M M 1 I I I I I I M I I i I I I I I I I I | | | | M I I I | | I | | | | | | M | I I I I I | | | | M I I 
Db 63 ALEPALAS PAGAANFLAMVDNLQGDS GRGYYLEMLI GTPPQKLQI LVDTGS SNFAVAGT P 122 

Q y 61 HS YI DTYFDTERSSTYRSKGFD VTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIATI FE 120 

' 1 ' I I I I I i I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I | | f | f | 1 | | | | | | | | | 
123 HS YIDTYFDTERS STYRSKGFD VTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIATI FE 182 



Db 



Qy 121 SENFFLPGIKWNGILGIAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 

1 ' 1 ' I I I I I I I I I I I I I I I I | | | | | M I II I I I I I I I I I f | | | | | | | | | | I I I I I | | | I I 
SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 242 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 240 

1 1 ' I I I I I I I I I I I I I I I I I I I I | | | | | | | | I I II I I I I I I I I I I I I I I I I | | | | 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 302 

DSGTTLLRLPQKVFDAVVKAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI YL 300 
I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | ,| | | M I I I I I I I I I I 

DSGTTLLRLPQKVFDAWEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKI SI YL 3 62 

Q y 301 RDENSSR SFRITILPQLYIQPMMGAGLNYECYRFGISPSTN7VLVIGATVMEGFYVIFDRA 360 

1 1 1 1 I II I I I I I I I I I I I I I I | | | | | | M I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | 
Db 363 RDENSSRSFR ITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 422 



Db 183 

Qy 181 

Db 243 

Qy 241 

Db 303 



Qy 361 
Db 423 



Q KRVGFAAS P CAE I AGAAVS E I S G P FS T EDVASNCVPAQ S L S EP I L 4 06 

I I I N I I I I I I I I | | | M M I I I | | | | | I I I I I I I | 

QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 68 



RESULT 6 
US-09-794-925-2 

Sequence 2, Application US/09794925 
Patent No. US20020064819A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND USES 

TITLE OF INVENTION: THEREFOR 

FILE REFERENCE: 28341/6280HI 

CURRENT APPLICATION NUMBER: US/ 09/7 94 , 925 

CURRENT FILING DATE: 2001-02-27 

PRIOR APPLICATION NUMBER: 09/416,901 

PRIOR FILING DATE: 1999-10-13 



PRIOR APPLICATION NUMBER: 60/155,493 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: 09/404,133 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US99/20881 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: PatentlnVer. 2.0 
; SEQ ID NO 2 

LENGTH: 518 
; TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-794-925-2 



Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


l 


ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLI GTP PQKLQILVDTGS SNFAVAGTP 

M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 1 t 1 ! 1 1 1 t 1 I 1 1 J I 1 1 1 l l l i t i i i i i i i > i . 

' 1 1 1 1 1 1 1 1 1 1 l l l l 1 1 1 1 1 1 II 1 1 1 II 1 1 1 M 1 M II 1 1 I I I I | | | | | | | | | [ | | | | | | 

ALEPALAS PAGAANFLAMVDNLQGDS GRGYYLEMLI GTPPQKLQI LVDTGS SNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 

>> 1 M 1 M 1 I f 1 1 1 1 M 1 I 1 1 1 1 I f I I I I | | | | M 1 1 1 1 II 1 f 1 I I I | | M 1 1 II 1 f 1 M 
HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

1 ' 1 1 M 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | ! 1 1 1 1 1 I I I 1 II 1 1 1 1 1 1 
SENFFLPGI KWNGI LGLAYATLAKPS S S LETFFDS LVTQANI PNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
1 1 1 1 1 1 1 1 1 1 N 1 II 1 1 1 1 || | | | | | | | | | | | | | | | | | | | | j | | | | | | , | , | | , | , | | , | 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQWFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 1 1 N 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 | | 1 1 1 1 1 1 1 1 I 1 1 1 1 1 II 1 1 
DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWS YFPKISI YL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | II 1 1 1 II II 1 1 1 1 1 1 1 1 1 | | | | | | | | | | M 1 1 1 1 1 
RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
I 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 I I I f | | | | | | | | | | | | | | | | | | | | | f | | | 

QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 68 




Db 


423 





RESULT 7 

US-09-215-450-19 

; Sequence 19, Application US/09215450 

; Patent No. US 2 002 006827 8A1 

; GENERAL INFORMATION: 

; APPLICANT: Giese, Klaus 

; APPLICANT: Xin, Hong 

; TITLE OF INVENTION: METASTATIC BREAST AND COLON CANCER REGULATED GENES 



FILE REFERENCE: 1451.100 / 210030.447 
; CURRENT APPLICATION NUMBER: US/ 0 9/2 15 , 4 5 0 

CURRENT FILING DATE: 1998-12-17 

NUMBER OF SEQ ID NOS : 27 
; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 19 

LENGTH: 518 
; TYPE: PRT 

ORGANISM: human 
US-09-215-450-19 



Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


1 


ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 
'> M M I 1 1 1 1 1 1 f M 1 I 1 1 I 1 I 1 1 1 1 1 M 1 | | | | | | | j | | | M 1 1 1 1 1 1 1 I I I | | | | | | 
ALE P ALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
' 1 1 1 1 M 1 1 1 1 1 1 1 | | M 1 1 1 II 1 I I j I | | | | | f | | | | | | | | | | | f | | | | ! j | | | |, ,, | 
HS YIDTYFDTERSSTYRSKGFDVTVKYTnr^^TATTr T^/r"T?nT \ft , t'd itmt-> o tt-t t nrT^mrnn 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

' 1 1 1 ' 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 
SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
1 1 1 1 1 1 1 1 1 II 1 N 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I | | | | | | || | | | | | | | | || | | | | | , | 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAVVEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKI SI YL 

1 ' 1 " 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 II 1 1 I I I | || | || | | | || | | | | | | | | 

DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWS YFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPQLYIQ PMMGAGLN YEC YRFGI S P S TNALVI GATVMEG FYVI FDRA 

' 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 M 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 

1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 M I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I I 

QKRVGFAAS P CAEI AGAAVS E I S GP FS T EDVASNCVPAQ S L S E P I L 4 68 




Db 


423 





RESULT 8 
US-09-681-442-2 

Sequence 2, Application US/09681442 
Patent No. US2 002 008 1 634A1 
GENERAL INFORMATION: 
APPLICANT: Gurney, Mark E. 
APPLICANT: Bienkowski, Michael J. 
APPLICANT: Heinrikson, Robert L. 
APPLICANT: Parodi, Luis A. 
APPLICANT: Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES ' 

TITLE OF INVENTION: THEREFOR 



FILE REFERENCE: 28341/6280FG 

CURRENT APPLICATION NUMBER: US/09/681,442 
CURRENT FILING DATE: 2001-04-05 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/2 08 8 1 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patent In Ver. 2.0 
; SEQ ID NO 2 
; LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-681-442-2 

Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 
>< f I I I I I I I I M I I II I f I ! I ! I ! M I II I I M I II I I I I I I i I | | | | M | | | | | | | | | 
63 ALEPALAS PAGAANFIlAMVDNLQGDSGRGYYLEMLI GTP PQKLQI LVDTGS SNFAVAGTP 122 

61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 12 0 
1 I I i I I I I I I I I I I I I I I I I I | | | | | f | | | | | | | I M I I I I I I I I I I | | M | I I I I I I | | 
123 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 182 

121 SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 

" I I I I I I I M I I I I I I I | | I I I I I I II I I I I I I I I I | | | | | | | | M I I I I I I I I 

183 SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 242 

181 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 24 0 

1 1 1 ' I I I I I I I I I I I | | | | | | | I I M I I I f I I I | | | | | | | | | ! | | M I I I I I I I | | I I I I 
243 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 302 

241 DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQIACWTNSETPWSYFPKISIYL 300 

I I I I I I I I I f M I I I I I II I I I I I I I I I I | | I I | | | | | | | | | | | I | | | | | | | | | | 

303 DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 362 

301 RDENS S RS FRI T I LPQL YI Q PMMGAGLN YEC YRFGI S PS TNALVI GATVMEGFYVT FDRA 360 

1 1 " I I I I I M I I I I I j I | | I I I I I I | | | | | I I I I I I I 

363 RDENS S RS FRI T I LPQL YI QPMMGAGLN YEC YRFGI S P STNALVI GATVMEGFYVT FDRA 422 

361 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 406 

' I I I I I I I I I I I I I I | | | | | | | | | | | | | | j | | | | f | | | | || } | | | | 
423 QKRVGFAAS P CAE I AGAAVS EISGPFSTE DVAS NCVPAQ S LS E P I L 4 68 

RESULT 9 

US- 09-97 8-2 95A- 196 

; Sequence 196, Application US/09978295A 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



Patent No. US20020156006A1 
GENERAL INFORMATION: 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Ashkenazi, Avi 
Baker Kevin P. 
Botstein, David 
Desnoyers, Luc 
Eaton, Dan 
Ferrara, Napoleon 
Filvaroff, Ellen 
Fong, Sherman 
Gao, Wei-Qiang 
Gerber, Hanspeter 
Gerritsen, Mary E. 
Goddard, Audrey 
Godowski, Paul J. 
Grimaldi, J. Christopher 
Gurney, Austin L. 
Hillan, Kenneth J 
Kljavin, Ivar J. 
Kuo, Sophia S. 
Napier, Mary A. 
Pan, James; 
Paoni, Nicholas F. 
Roy, Margaret Ann 
Shelton, David L. 
Stewart, Timothy A. 
Tumas , Daniel 
Williams, P. Mickey 



Wood, William I. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C11 

CURRENT APPLICATION NUMBER: US /09/978, 295A 
CURRENT FILING DATE: 2001-10-15 
PRIOR APPLICATION NUMBER: 09/918585 
PRIOR FILING DATE: 2001-07-30 
PRIOR APPLICATION NUMBER: 60/062250 
PRIOR FILING DATE: 1997-10-17 
PRIOR APPLICATION NUMBER: 60/064249 
PRIOR FILING DATE: 1997-11-03 
PRIOR APPLICATION NUMBER: 60/065311 
PRIOR FILING DATE: 1997-11-13 
PRIOR APPLICATION NUMBER: 60/066364 
PRIOR FILING DATE: 1997-11-21 
PRIOR APPLICATION NUMBER: 60/077450 
PRIOR FILING DATE: 1998-03-10 
PRIOR APPLICATION NUMBER: 60/077632 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077641 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077649 

1998-03-11 
NUMBER: 60/077791 

1998-03-12 
NUMBER: 60/078004 
1998-03-13 



PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 



PRIOR APPLICATION NUMBER: 60/078886 



PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078936 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078910 
; PRIOR FILING DATE: 1998-03-20 
; PRIOR APPLICATION NUMBER: 60/078939 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/079294 
PRIOR FILING DATE: 1998-03-25 
; PRIOR APPLICATION NUMBER: 60/079656 
PRIOR FILING DATE: 1998-03-26 
PRIOR APPLICATION NUMBER: 60/079664 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079689 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079663 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079728 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079786 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079920 
; PRIOR FILING DATE: 1998-03-30 

PRIOR APPLICATION NUMBER: 60/079923 
PRIOR FILING DATE: 1998-03-30 
PRIOR APPLICATION NUMBER: 60/080105 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080107 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080165 
PRIOR FILING DATE: 1998-03-31 
; PRIOR APPLICATION NUMBER: 60/080194 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080327 
PRIOR FILING DATE: 1998-04-01 
; PRIOR APPLICATION NUMBER: 60/080328 
; PRIOR FILING DATE: 1998-04-01 
; PRIOR APPLICATION NUMBER: 60/080333 
; PRIOR FILING DATE: 1998-04-01 
; PRIOR APPLICATION NUMBER: 60/080334 
r PRIOR FILING DATE: 1998-04-01 
: PRIOR APPLICATION NUMBER: 60/081070 
f PRIOR FILING DATE: 1998-04-08 
r PRIOR APPLICATION NUMBER: 60/081049 
■ PRIOR FILING DATE: 1998-04-08 

PRIOR APPLICATION NUMBER: 60/081071 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081195 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081203 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081229 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081955 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081817 
PRIOR FILING DATE: 1998-04-15 



; PRIOR APPLICATION NUMBER: 60/081819 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081952 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081838 
PRIOR FILING DATE: 1998-04-15 
; PRIOR APPLICATION NUMBER: 60/082568 

PRIOR FILING DATE: 1998-04-21 
; PRIOR APPLICATION NUMBER: 60/082569 
PRIOR FILING DATE: 1998-04-21 
PRIOR APPLICATION NUMBER: 60/082704 
PRIOR FILING DATE: 1998-04-22 
; PRIOR APPLICATION NUMBER: 60/082804 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082700 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082797 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082796 
; PRIOR FILING DATE: 1998-04-23 
; PRIOR APPLICATION NUMBER: 60/083336 
PRIOR FILING DATE: 1998-04-27 
PRIOR APPLICATION NUMBER: 60/083322 
PRIOR FILING DATE: 1998-04-28 
PRIOR APPLICATION NUMBER: 60/083392 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083495 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083496 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083499 
; PRIOR FILING DATE: 1998-04-29 

PRIOR APPLICATION NUMBER: 60/083545 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083554 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083558 
PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083559 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083500 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083742 
PRIOR FILING DATE: 1998-04-30 
PRIOR APPLICATION NUMBER: 60/084366 
PRIOR FILING DATE: 1998-05-05 
; PRIOR APPLICATION NUMBER: 60/084414 
PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084441 
PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084637 
; PRIOR FILING DATE: 1998-05-07 
; PRIOR APPLICATION NUMBER: 60/084639 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084640 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084598 



PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084600 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084627 
; PRIOR FILING DATE: 1998-05-07 

PRIOR APPLICATION NUMBER: 60/084643 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/085339 
PRIOR FILING DATE: 1998-05-13 
; PRIOR APPLICATION NUMBER: 60/085338 

PRIOR FILING DATE: 1998-05-13 
; PRIOR APPLICATION NUMBER: 60/085323 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085582 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085700 
PRIOR FILING DATE: 1998-05-15 
; PRIOR APPLICATION NUMBER: 60/085689 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085579 
; PRIOR FILING DATE: 1998-05-15 
; PRIOR APPLICATION NUMBER: 60/085580 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085573 
PRIOR FILING DATE: 1998-05-15 
; PRIOR APPLICATION NUMBER: 60/085704 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085697 



Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



Qy 


1 


ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

• 1 1 1 ' 1 ' 1 N 1 1 1 1 1 1 1 I I I | | | | | | M | | | | | | | || | | | || | | | | | | | | , , , | , 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 

1 1 ' f 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 | M 1 I 1 1 1 1 1 II 1! 1 ! 1 1 1 | 1 i 1 1 1 1 M 1 1 1 1 1 1 1 | 1 I 1 1 1 
HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

1 < 1 1 1 1 1 M ! 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 II 1 ! i 1 1 1 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

1 1 > 1 I f i 1 1 1 1 1 1 1 1 1 1 1 1 I 1 | | | | | 1 M 1 1 ! 1 1 1 1 1 1 1 M 1 1 1 1 | | f 1 I 1 1 1 I I I 1 1 1 1 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 M I 1 1 1 1 1 1 1 1 1 1 I 1 | | | | | | | | 1 I 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 f 1 1 1 M 1 1 1 I 1 1 1 
DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGI SPSTNALVIGATVMEGFYVIFDRA 

>' N 1 M 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 I 1 1 1 1 1 1 M 1 i 1 1 1 1 1 1 I I 1 I | | 

RD ENS SRS FRIT I LPQLYIQPMMGAGLNYECYRFGI SPSTNALVIGATVMEGFYVIFDRA 


360 


Db 


363 


422 



Qy 361 QKRVGFAA.SPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 406 

' I I M I 1 ! I I I I II t I I ! I I I I I I I ! I I I | M | | | | | | | | I | | | | | 

Db 423 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 68 



RESULT 10 
US-09-886-143-2 

; Sequence 2, Application US/09886143 

; Patent No. US2 002 0159991A1 

; GENERAL INFORMATION: 

; APPLICANT: Cordell, Barbara 

; APPLICANT: Schimmoller, Frauke 

; APPLICANT: Liu, Yu-Wang 

; APPLICANT: Quon, Diana Horn 

; TITLE OF INVENTION: Modulation of A Levels by 
; TITLE OF INVENTION: Secretase BACE2 
FILE REFERENCE: SCIOS.022A 

CURRENT APPLICATION NUMBER: US/ 09/886, 143 
CURRENT FILING DATE: 2001-06-20 
PRIOR APPLICATION NUMBER: 60/215,729 
PRIOR FILING DATE: 2000-06-28 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 2 
; LENGTH: 518 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-886-143-2 



Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



Qy 


1 


ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

1 1 11 ' 1 1 1 1 M 1 1 f 1 1 1 1 M 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 M 1 1 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 

1 > M 1 f 1 M 1 1 1 1 1 1 1 1 I I I I | | | | | | | | m 1 I 1 1 1 1 I 1 M 1 1 1 1 1 1 1 I I I I | | | | | | M 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

1 1 1 1 1 I 1 1 1 1 1 1 1 1 M 1 1 1 1 1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 M 1 1 1 1 I I I I I | 1 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

11111 " 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

i 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 I | | | | | | | 1 1 

DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENS SRS ERITI LPQLYI QPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FDRA 

1 1 " 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNAL VI GATVMEGFYVI FDRA 


360 


Db 


363 


422 



Q y 361 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 406 

" II I I I II I I I I I I I | | | | f I | | f| f I I I I I I I | I | | | | | | I | | | 
Db 423 QKRVGFTVASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 468 



RESULT 11 

US- 09- 97 8-697-196 

Sequence 196, Application US/09978697 
Patent No. US2 0 02 01692 84A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Ashkenazi, Avi 
Baker Kevin P . 
Botstein, David 
Desnoyers, Luc 
Eaton, Dan 
Ferrara, Napoleon 
Filvaroff, Ellen 
Fong, Sherman 
Gao, Wei-Qiang 
Gerber, Hanspeter 
Gerritsen, Mary E. 
Goddard, Audrey 
Godows ki , Paul J . 
Grimaldi, J. Christopher 
Gurney, Austin L. 
Hill an, Kenneth J 
Kl j avin, Ivar J . 
Kuo, Sophia S . 
Napier, Mary A. 
Pan, James; 
Paoni, Nicholas F. 
Roy, Margaret Ann 
Shelton, David L. 
Stewart, Timothy A. 
Tumas, Daniel 
Williams, P. Mickey 



Wood, William I. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C27 

CURRENT APPLICATION NUMBER: US/09/978,697 
CURRENT FILING DATE: 2001-10-16 
PRIOR APPLICATION NUMBER: 09/918585 
PRIOR FILING DATE: 2001-07-30 
PRIOR APPLICATION NUMBER: 60/062250 
PRIOR FILING DATE: 1997-10-17 
PRIOR APPLICATION NUMBER: 60/064249 
PRIOR FILING DATE: 1997-11-03 
PRIOR APPLICATION NUMBER: 60/065311 

1997-11-13 
NUMBER: 60/066364 

1997- 11-21 
NUMBER: 60/077450 

1998- 03-10 
NUMBER: 60/077632 

1998-03-11 
NUMBER: 60/077641 



PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 



PRIOR FILING DATE: 1998-03-11 
; PRIOR APPLICATION NUMBER: 60/077649 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077791 
PRIOR FILING DATE: 1998-03-12 
PRIOR APPLICATION NUMBER: 60/078004 
PRIOR FILING DATE: 1998-03-13 
; PRIOR APPLICATION NUMBER: 60/078886 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078936 
PRIOR FILING DATE: 1998-03-20 
; PRIOR APPLICATION NUMBER: 60/078910 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078939 
PRIOR FILING DATE: 1998-03-20 
; PRIOR APPLICATION NUMBER: 60/079294 
PRIOR FILING DATE: 1998-03-25 
PRIOR APPLICATION NUMBER: 60/079656 
PRIOR FILING DATE: 1998-03-26 
PRIOR APPLICATION NUMBER: 60/079664 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079689 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079663 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079728 
PRIOR FILING DATE: 1998-03-27 
; PRIOR APPLICATION NUMBER: 60/079786 

PRIOR FILING DATE: 1998-03-27 
; PRIOR APPLICATION NUMBER: 60/079920 

PRIOR FILING DATE: 1998-03-30 
; PRIOR APPLICATION NUMBER: 60/079923 

PRIOR FILING DATE: 1998-03-30 
; PRIOR APPLICATION NUMBER: 60/080105 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080107 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080165 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080194 
PRIOR FILING DATE: 1998-03-31 
; PRIOR APPLICATION NUMBER: 60/080327 

PRIOR FILING DATE: 1998-04-01 
; PRIOR APPLICATION NUMBER: 60/080328 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080333 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080334 
; PRIOR FILING DATE: 1998-04-01 

PRIOR APPLICATION NUMBER: 60/08107 0 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081049 
PRIOR FILING DATE: 1998-04-08 
; PRIOR APPLICATION NUMBER: 60/081071 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081195 
PRIOR FILING DATE: 1998-04-08 



PRIOR APPLICATION NUMBER: 60/081203 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081229 
; PRIOR FILING DATE: 1998-04-09 
; PRIOR APPLICATION NUMBER: 60/081955 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081817 
PRIOR FILING DATE: 1998-04-15 
; PRIOR APPLICATION NUMBER: 60/081819 
; PRIOR FILING DATE: 1998-04-15 

PRIOR APPLICATION NUMBER: 60/081952 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081838 
PRIOR FILING DATE: 1998-04-15 
; PRIOR APPLICATION NUMBER: 60/082568 
PRIOR FILING DATE: 1998-04-21 
PRIOR APPLICATION NUMBER: 60/082569 
; PRIOR FILING DATE: 1998-04-21 

PRIOR APPLICATION NUMBER: 60/082704 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082804 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082700 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082797 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082796 
PRIOR FILING DATE: 1998-04-23 
PRIOR APPLICATION NUMBER: 60/083336 
PRIOR FILING DATE: 1998-04-27 
PRIOR APPLICATION NUMBER: 60/083322 
PRIOR FILING DATE: 1998-04-28 
PRIOR APPLICATION NUMBER: 60/083392 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083495 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/0834 96 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083499 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083545 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083554 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083558 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083559 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083500 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083742 
PRIOR FILING DATE: 1998-04-30 
PRIOR APPLICATION NUMBER: 60/084366 
PRIOR FILING DATE: 1998-05-05 
PRIOR APPLICATION NUMBER: 60/084414 
PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084441 



PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084637 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084639 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084 640 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084598 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084600 
PRIOR FILING DATE: 1998-5-07 
PRIOR APPLICATION NUMBER: 60/084627 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084643 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/085339 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085338 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085323 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085582 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085700 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085689 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085579 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085580 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085573 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085704 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085697 

Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; Gaps 0; 

Qy 1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

„ 1 1 1 1 I I I I I I I I I I I I I I I I I I I I I i | | | M M I I I I I I I I t I M I I I I I I I I I I I i I I | 

DJD 63 ^ EPALASPAG AANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 122 

Qy 61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI FE 120 

noo I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I | | | I I I I I I II I I I I II 
Db 123 HS YIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 182 

Qy 121 SENFFLPGIKWNGILGLAYATLAXPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 

1 1 " 1 ' 1 I M I I I I I I I I I I I I I I I I | | | | I I I I | || | | | | | || || | | J | | | | | | 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 242 



Db 183 
Qy 181 



Db 243 



GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 24 0 

1 " 1 > I I I I II I I I II II | | | | | | | | | | | I ! I I I I M I I ] I I M I I I If 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 3 02 



Ht7 

yy 


2 41 


DSGTTLLRLPQICVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 
N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 i 1 M 1 1 I 1 | | | | | | | | | | M f 

DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


JJD 


o o 

J (J J 


362 




301 


RDENS SRS FRITILPQLYIQPMMGAGLNYECYRFGI SPSTNALVIGATVMEGFYVI FDRA 

1 1 1 1 1 1 1 I | I 1 | | | | I | | | | M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 * 1 1 1 

« ■ ■ • ■ i i i ■ i i i i i i i i i i ii i ii ii ii m 1 1 II 1 1 M 1 1 1 1 II 

RDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
1 1 ' 1 1 1 1 1 1 1 M I 1 1 I I 1 1 f 1 1 1 1 | i | 1 | | | | | I | | | | | | | | | | | | 
Q KRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N C VP AQ S L S E P I L 4 68 




Db 


423 





RESULT 12 

US-09-97 8-192A-196 

Sequence 196, Application US/09978192A 
Patent No. US20020177553A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Ashkenazi, Avi 
Baker Kevin P. 
Botstein, David 
Desnoyers, Luc 
Eaton, Dan 
Ferrara, Napoleon 
Filvaroff, Ellen 
Fong, Sherman 
Gao, Wei-Qiang 
Gerber, Hanspeter 
Gerritsen, Mary E. 
Goddard, Audrey 
Godowski, Paul J. 
Grimaldi, J. Christopher 
Gurney, Austin L . 
Hillan, Kenneth J 
Kljavin, Ivar J. 
Kuo, Sophia S. 
Napier, Mary A. 
Pan, James; 
Paoni, Nicholas F. 
Roy, Margaret Ann 
Shelton, David L. 
Stewart, Timothy A. 
Tumas, Daniel 
Williams, P. Mickey 



Wood, William I. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C9 

CURRENT APPLICATION NUMBER: US/09/978, 192A 
CURRENT FILING DATE: 2001-10-15 
PRIOR APPLICATION NUMBER: 09/918585 

2001-07-30 
NUMBER: 60/062250 

1997-10-17 
NUMBER: 60/064249 

1997-11-03 
NUMBER: 60/065311 



PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 



; PRIOR FILING DATE: 1997-11-13 
; PRIOR APPLICATION NUMBER: 60/066364 
; PRIOR FILING DATE: 1997-11-21 
; PRIOR APPLICATION NUMBER: 60/077450 
PRIOR FILING DATE: 1998-03-10 
PRIOR APPLICATION NUMBER: 60/077632 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077641 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077649 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077791 
PRIOR FILING DATE: 1998-03-12 
PRIOR APPLICATION NUMBER: 60/078004 
PRIOR FILING DATE: 1998-03-13 
PRIOR APPLICATION NUMBER: 60/078886 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078936 
; PRIOR FILING DATE: 1998-03-20 

PRIOR APPLICATION NUMBER: 60/078910 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078 939 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/079294 
PRIOR FILING DATE: 1998-03-25 
; PRIOR APPLICATION NUMBER: 60/079656 
PRIOR FILING DATE: 1998-03-26 
PRIOR APPLICATION NUMBER: 60/079664 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079689 
PRIOR FILING DATE: 1998-03-27 
; PRIOR APPLICATION NUMBER: 60/079663 
; PRIOR FILING DATE: 1998-03-27 

PRIOR APPLICATION NUMBER: 60/079728 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079786 
; PRIOR FILING DATE: 1998-03-27 

PRIOR APPLICATION NUMBER: 60/079920 
PRIOR FILING DATE: 1998-03-30 
PRIOR APPLICATION NUMBER: 60/079923 
PRIOR FILING DATE: 1998-03-30 
PRIOR APPLICATION NUMBER: 60/080105 
; PRIOR FILING DATE: 1998-03-31 

PRIOR APPLICATION NUMBER: 60/080107 
; PRIOR FILING DATE: 1998-03-31 

PRIOR APPLICATION NUMBER: 60/080165 
PRIOR FILING DATE: 1998-03-31 
; PRIOR APPLICATION NUMBER: 60/080194 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080327 
; PRIOR FILING DATE: 1998-04-01 

PRIOR APPLICATION NUMBER: 60/080328 
PRIOR FILING DATE: 1998-04-01 
; PRIOR APPLICATION NUMBER: 60/080333 

PRIOR FILING DATE: 1998-04-01 
; PRIOR APPLICATION NUMBER: 60/080334 
PRIOR FILING DATE: 1998-04-01 



; PRIOR APPLICATION NUMBER: 60/081070 
; PRIOR FILING DATE: 1998-04-08 

PRIOR APPLICATION NUMBER: 60/081049 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081071 
PRIOR FILING DATE: 1998-04-08 
; PRIOR APPLICATION NUMBER: 60/081195 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081203 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081229 
; PRIOR FILING DATE: 1998-04-09 

PRIOR APPLICATION NUMBER: 60/081955 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081817 
; PRIOR FILING DATE: 1998-04-15 
; PRIOR APPLICATION NUMBER: 60/081819 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081952 
PRIOR FILING DATE: 1998-04-15 
; PRIOR APPLICATION NUMBER: 60/081838 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/082568 
; PRIOR FILING DATE: 1998-04-21 

PRIOR APPLICATION NUMBER: 60/082569 
PRIOR FILING DATE: 1998-04-21 
PRIOR APPLICATION NUMBER: 60/082704 
PRIOR FILING DATE: 1998-04-22 
; PRIOR APPLICATION NUMBER: 60/082804 
; PRIOR FILING DATE: 1998-04-22 

PRIOR APPLICATION NUMBER: 60/082700 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082797 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082796 
; PRIOR FILING DATE: 1998-04-23 
; PRIOR APPLICATION NUMBER: 60/083336 
PRIOR FILING DATE: 1998-04-27 
PRIOR APPLICATION NUMBER: 60/083322 
PRIOR FILING DATE: 1998-04-28 
PRIOR APPLICATION NUMBER: 60/083392 
; PRIOR FILING DATE: 1998-04-29 

PRIOR APPLICATION NUMBER: 60/083495 
PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083496 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083499 
; PRIOR FILING DATE: 1998-04-29 

PRIOR APPLICATION NUMBER: 60/083545 
PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083554 

PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083558 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083559 
PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083500 



PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083742 
PRIOR FILING DATE: 1998-04-30 
PRIOR APPLICATION NUMBER: 60/084366 
PRIOR FILING DATE: 1998-05-05 
PRIOR APPLICATION NUMBER: 60/084414 
PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084441 
PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084637 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084639 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084640 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084598 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084 600 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084627 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084643 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/085339 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085338 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085323 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085582 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085700 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085689 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085579 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085580 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085573 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085704 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085697 

Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 -^LEPALAS P^GAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 

Db « iJiililil 1 " IMIM 1111 1,1111 11 " " "MM II I MINI II II II I II I I 

Dt> 63 PALAS PAGAAN FLAMVDNLQGDS GRG YYLEMLI GT P PQKLQI LVDTGS SNFAVAGT P 122 

Q y 61 HSYIDTYFDTERSSTYRSKGFDVTVI<YTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI FE 120 

M II M II I | | | | | | | | M I M I M | | || || | | || | | || | | | | | | ,| , | , | | | 

Db 123 HSYIDTYFD TERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 182 



wy 


Izi 1 


SENFFLPGIKWNGILGIiAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 
1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | | | | | | 1 1 1 1 1 1 1 f ! M f 1 1 1 1 1 M 1 M 1 1 I 1 1 I 1 I 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 






242 




TOT 

1 o 1 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 M 1 1 1 1 1 1 1 f M I 1 1 ! | | | | | M | | | | f | | | | | | | t | 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 






302 


\iy 


O /I 1 

4 1 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 
" 1 1 < 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | f | | | | | | | | | | | || | , | || ,, | | ,,, M , 

DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWS YFPKISI YL 


300 




o n o 


362 


yy 


301 


RDENS SRS FRI TI LPQLYXQPMMGAGLNYEC YRFGI S PSTNALVI GATVMEGFYVI FDRA 
1 1 M 1 1 f 1 1 M 1 1 1 II 1 1 1 1 1 1 | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 t i 1 1 1 M 1 1 1 i i i i i i 
RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
' > 1 1 ! 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | 

QKRVGFAAS PCAE I AGAAVS E I S G P FS T EDVASNCVPAQ S L S E P I L 468 




Db 


423 





RESULT 13 

US-09-999-8 32A-196 

Sequence 196, Application US/09999832A 
Publication No. US 2 0020192706A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF 



Ashkenazi, Avi 
Baker Kevin P. 
Botstein, David 
Desnoyers, Luc 
Eaton, Dan 
Ferrara, Napoleon 
Filvaroff, Ellen 
Fong, Sherman 
Gao, Wei-Qiang 
Gerber, Hanspeter 
Gerritsen, Mary E. 
Goddard, Audrey 
Godowski, Paul J. 
Grimaldi, J. Christopher 
Gurney, Austin L. 
Hillan, Kenneth J 
Kljavin, Ivar J. 
Kuo, Sophia S. 
Napier, Mary A. 
Pan, James; 
Paoni, Nicholas F. 
Roy, Margaret Ann 
Shelton, David L. 
Stewart, Timothy A. 
Tumas, Daniel 
Williams, P. Mickey 
Wood, William I . 
INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 



TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C63 

CURRENT APPLICATION NUMBER: US/09/999, 832A 



CURRENT FILING DATE: 2001-10-24 
PRIOR APPLICATION NUMBER: 09/918585 
; PRIOR FILING DATE: 2001-07-30 

PRIOR APPLICATION NUMBER: 60/062250 
PRIOR FILING DATE: 1997-10-17 
PRIOR APPLICATION NUMBER: 60/064249 
PRIOR FILING DATE: 1997-11-03 
PRIOR APPLICATION NUMBER: 60/065311 
PRIOR FILING DATE: 1997-11-13 
; PRIOR APPLICATION NUMBER: 60/066364 
PRIOR FILING DATE: 1997-11-21 
PRIOR APPLICATION NUMBER: 60/077450 
; PRIOR FILING DATE: 1998-03-10 

PRIOR APPLICATION NUMBER: 60/077632 
PRIOR FILING DATE: 1998-03-11 
; PRIOR APPLICATION NUMBER: 60/077641 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077649 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077791 
PRIOR FILING DATE: 1998-03-12 
PRIOR APPLICATION NUMBER: 60/078004 
PRIOR FILING DATE: 1998-03-13 
PRIOR APPLICATION NUMBER: 60/078886 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078936 
; PRIOR FILING DATE: 1998-03-20 

PRIOR APPLICATION NUMBER: 60/078910 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078939 
PRIOR FILING DATE: 1998-03-20 
; PRIOR APPLICATION NUMBER: 60/079294 
PRIOR FILING DATE: 1998-03-25 
PRIOR APPLICATION NUMBER: 60/079656 
PRIOR FILING DATE: 1998-03-26 
; PRIOR APPLICATION NUMBER: 60/079664 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079689 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079663 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079728 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079786 
; PRIOR FILING DATE: 1998-03-27 

PRIOR APPLICATION NUMBER: 60/079920 
PRIOR FILING DATE: 1998-03-30 
PRIOR APPLICATION NUMBER: 60/079923 
PRIOR FILING DATE: 1998-03-30 
PRIOR APPLICATION NUMBER: 60/080105 
PRIOR FILING DATE: 1998-03-31 
; PRIOR APPLICATION NUMBER: 60/080107 
; PRIOR FILING DATE: 1998-03-31 

PRIOR APPLICATION NUMBER: 60/080165 
PRIOR FILING DATE: 1998-03-31 
; PRIOR APPLICATION NUMBER: 60/080194 
PRIOR FILING DATE: 1998-03-31 



PRIOR APPLICATION NUMBER: 60/080327 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080328 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080333 
; PRIOR FILING DATE: 1998-04-01 
; PRIOR APPLICATION NUMBER: 60/080334 
; PRIOR FILING DATE: 1998-04-01 
; PRIOR APPLICATION NUMBER: 60/081070 
; PRIOR FILING DATE: 1998-04-08 
; PRIOR APPLICATION NUMBER: 60/081049 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081071 
; PRIOR FILING DATE: 1998-04-08 

PRIOR APPLICATION NUMBER: 60/081195 
; PRIOR FILING DATE: 1998-04-08 
; PRIOR APPLICATION NUMBER: 60/081203 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081229 
; PRIOR FILING DATE: 1998-04-09 

PRIOR APPLICATION NUMBER: 60/081955 
; PRIOR FILING DATE: 1998-04-15 

PRIOR APPLICATION NUMBER: 60/081817 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081819 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081952 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081838 
; PRIOR FILING DATE: 1998-04-15 
; PRIOR APPLICATION NUMBER: 60/082568 
PRIOR FILING DATE: 1998-04-21 
PRIOR APPLICATION NUMBER: 60/082569 
PRIOR FILING DATE: 1998-04-21 
PRIOR APPLICATION NUMBER: 60/082704 
PRIOR FILING DATE: 1998-04-22 
; PRIOR APPLICATION NUMBER: 60/082804 
; PRIOR FILING DATE: 1998-04-22 

PRIOR APPLICATION NUMBER: 60/082700 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082797 
; PRIOR FILING DATE: 1998-04-22 

PRIOR APPLICATION NUMBER: 60/082796 
PRIOR FILING DATE: 1998-04-23 
; PRIOR APPLICATION NUMBER: 60/083336 

PRIOR FILING DATE: 1998-04-27 
; PRIOR APPLICATION NUMBER: 60/083322 
PRIOR FILING DATE: 1998-04-28 
PRIOR APPLICATION NUMBER: 60/083392 
PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083495 

PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083496 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083499 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083545 



PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083554 

PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083558 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083559 
PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083500 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083742 
PRIOR FILING DATE: 1998-04-30 
; PRIOR APPLICATION NUMBER: 60/084366 
PRIOR FILING DATE: 1998-05-05 
PRIOR APPLICATION NUMBER: 60/084414 
PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084441 
PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084637 
PRIOR FILING DATE: 1998-05-07 
; PRIOR APPLICATION NUMBER: 60/084639 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084640 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084598 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084600 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084 627 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084643 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/085339 
PRIOR FILING DATE: 1998-05-13 
; PRIOR APPLICATION NUMBER: 60/085338 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085323 
; PRIOR FILING DATE: 1998-05-13 

PRIOR APPLICATION NUMBER: 60/085582 
; PRIOR FILING DATE: 1998-05-15 

PRIOR APPLICATION NUMBER: 60/085700 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085689 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085579 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085580 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085573 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085704 
PRIOR FILING DATE: 1998-05-15 
; PRIOR APPLICATION NUMBER: 60/085697 



Query Match 100.0%; Score 2113; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; Gaps 0; 



yy 


I 


ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

' < 1 1 1 1 M 1 I f I I I I I I | I I | M I 1 1 I 1 I I I I I I | | I | j | | | | | M 1 1 ! 1 1 1 1 I I I M | | 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 




OO 


122 


yy 


Dl 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATIFE 

1 ' 1 1 f I 1 M 1 1 1 1 1 I I I | II 1 1 1 M 1 1 f 1 1 1 1 II II 1 1 1 1 f 1 1 1 I II I 1 1 1 1 I I I f 1 II f 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATIFE 


120 






182 


yy 


TOT 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 
11 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | 1 I I I I || | | || | | | | | | | | | | | | | 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 




TOO 

loo 


242 


CiXr 

yy 


TOT 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

1 ' 1 " 1 1 1 1 1 1 1 1 I I I I | | | | 1 1 1 1 1 1 1 I I 1 1 II 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


jjjj 


z 4 o 


302 


yy 


z4 1 


DSGTTLLRLPQKVFDAWEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKISI YL 
' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I I I | | I I | | | | M 1 1 1 1 1 1 1 1 1 I I | | | M 1 1 I I I I 

T~\ I"" 1 f* m III T "T*" T~\ T ^» _ __ _ __ III '''lift 

DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


I/O 


303 


362 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 
1 1 1 1 1 1 1 1 1 | | | | | | | | | 1 | | | | 1 | | || | | 1 II II 1 1 II r j | 1 | | 1 1 ii 1 1 1 1 | , 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 

1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | M 1 1 1 1 1 1 1 | | | 1 1 
QKRVGFAAS P CAE I AGAAVS E I S GP FS T EDVAS N CVPAQS LS E P I L 4 68 




Db 


423 





RESULT 14 
US-09-97 8-18 9-19 6 

Sequence 196, Application US/09978189 
Publication No. US2 00300041 02A1 
GENERAL INFORMATION: 



APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 



Ashkenazi, Avi 

Baker Kevin P. 

Botstein, David 

Desnoyers, Luc 

Eaton, Dan 

Ferrara, Napoleon 

Filvaroff, Ellen 

Fong, Sherman 

Gao, Wei-Qiang 

Gerber, Hanspeter 

Gerritsen, Mary E. 

Goddard, Audrey 

Godowski, Paul J. 

Grimaldi, J. Christopher 

Gurney, Austin L. 

Hillan, Kenneth J 

Kljavin, Ivar J. 

Kuo, Sophia S. 

Napier, Mary A. 

Pan, James; 

Paoni, Nicholas F. 

Roy, Margaret Ann 

Shelton, David L . 



APPLICANT: Stewart, Timothy A. 
APPLICANT : Tumas, Daniel 
APPLICANT: Williams, P. Mickey 
APPLICANT: Wood, William I. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C7 

CURRENT APPLICATION NUMBER: US/ 09/ 97 8 , 1 8 9 
CURRENT FILING DATE: 2001-10-15 
PRIOR APPLICATION NUMBER: 09/918585 
PRIOR FILING DATE: 2001-07-30 
PRIOR APPLICATION NUMBER: 60/062250 
PRIOR FILING DATE: 1997-10-17 
PRIOR APPLICATION NUMBER: 60/064249 

PRIOR FILING DATE: 1997-11-03 

PRIOR APPLICATION NUMBER: 60/065311 

PRIOR FILING DATE: 1997-11-13 

PRIOR APPLICATION NUMBER: 60/066364 

PRIOR FILING DATE: 1997-11-21 

PRIOR APPLICATION NUMBER: 60/077450 

PRIOR FILING DATE: 1998-03-10 

PRIOR APPLICATION NUMBER: 60/077632 

PRIOR FILING DATE: 1998-03-11 

PRIOR APPLICATION NUMBER: 60/077641 

PRIOR FILING DATE: 1998-03-11 

PRIOR APPLICATION NUMBER: 60/077649 

PRIOR FILING DATE: 1998-03-11 

PRIOR APPLICATION NUMBER: 60/077791 

PRIOR FILING DATE: 1998-03-12 

PRIOR APPLICATION NUMBER: 60/078004 

PRIOR FILING DATE: 1998-03-13 

PRIOR APPLICATION NUMBER: 60/078 886 

PRIOR FILING DATE: 1998-03-20 

PRIOR APPLICATION NUMBER: 60/078936 

PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078910 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078939 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/079294 
PRIOR FILING DATE: 1998-03-25 
PRIOR APPLICATION NUMBER: 60/079656 
PRIOR FILING DATE: 1998-03-26 
PRIOR APPLICATION NUMBER: 60/079664 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079689 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079663 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079728 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079786 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079920 
PRIOR FILING DATE: 1998-03-30 
PRIOR APPLICATION NUMBER: 60/079923 
PRIOR FILING DATE: 1998-03-30 



PRIOR APPLICATION NUMBER: 60/080105 
; PRIOR FILING DATE: 1998-03-31 
; PRIOR APPLICATION NUMBER: 60/080107 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080165 
PRIOR FILING DATE: 1998-03-31 
; PRIOR APPLICATION NUMBER: 60/080194 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080327 
PRIOR FILING DATE: 1998-04-01 
; PRIOR APPLICATION NUMBER: 60/080328 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080333 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080334 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/081070 
; PRIOR FILING DATE: 1998-04-08 

PRIOR APPLICATION NUMBER: 60/081049 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081071 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081195 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081203 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081229 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081955 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081817 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081819 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081952 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081838 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/082568 
PRIOR FILING DATE: 1998-04-21 
; PRIOR APPLICATION NUMBER: 60/082569 

PRIOR FILING DATE: 1998-04-21 
; PRIOR APPLICATION NUMBER: 60/082704 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082804 
; PRIOR FILING DATE: 1998-04-22 

PRIOR APPLICATION NUMBER: 60/082700 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082797 
; PRIOR FILING DATE: 1998-04-22 

PRIOR APPLICATION NUMBER: 60/082796 
PRIOR FILING DATE: 1998-04-23 
PRIOR APPLICATION NUMBER: 60/083336 
PRIOR FILING DATE: 1998-04-27 
PRIOR APPLICATION NUMBER: 60/083322 
PRIOR FILING DATE: 1998-04-28 
PRIOR APPLICATION NUMBER: 60/083392 



; PRIOR FILING DATE: 1998-04-29 

PRIOR APPLICATION NUMBER: 60/083495 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083496 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083499 
; PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083545 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083554 
PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083558 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083559 
; PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083500 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083742 
PRIOR FILING DATE: 1998-04-30 
; PRIOR APPLICATION NUMBER: 60/084366 
PRIOR FILING DATE: 1998-05-05 
PRIOR APPLICATION NUMBER: 60/084414 
PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084441 
PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084637 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084639 
PRIOR FILING DATE: 1998-05-07 
; PRIOR APPLICATION NUMBER: 60/084640 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084598 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084600 
PRIOR FILING DATE: 1998-5-07 
PRIOR APPLICATION NUMBER: 60/084627 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084643 
PRIOR FILING DATE: 1998-05-07 
; PRIOR APPLICATION NUMBER: 60/085339 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085338 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085323 
PRIOR FILING DATE: 1998-05-13 
; PRIOR APPLICATION NUMBER: 60/085582 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085700 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085689 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085579 
PRIOR FILING DATE: 1998-05-15 
; PRIOR APPLICATION NUMBER: 60/085580 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085573 
PRIOR FILING DATE: 1998-05-15 



PRIOR APPLICATION NUMBER: 60/085704 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085697 



Query Match 100.0%; Score 2113; DB 10; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; Gaps 



0; 



Qy 


1 


ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 
' > M I I 1 1 1 ! 1 1 M 1 1 1 I 1 M I I I I | | | j | | | | | M f | | | | | | , | f , , , | , , , ,| , , , , , 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLI GTP PQKLQI LVDTGS SNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
' 1 1 M i ! 1 1 M 1 1 1 1 1 1 I I I 1 1 1 1 | | 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | 1 1 I 1 1 1 1 I I I I I 
HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

' 1 1 M 1 1 1 1 f 1 1 1 I I I I | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | 1 1 
SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I I I i i i i i i i i i i t i i i i 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 
1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | 1 1 1 1 1 | | | | | | 1 1 1 1 1 I I I 
DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENS S RS FRITILPQLYIQPMMGAGLN YECYRFGI S PSTNALVIGAT VMEGFYVT FDRA 

1 ' f 1 1 1 1 1 1 1 1 1 I 1 1 II 1 1 1 1 1 1 I I I I I | 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I I 

RDENS SRS FRITI LPQL YIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEI SGPFSTEDVASNCVPAQSLSEPIL 4 06 

1 1 1 1 1 1 1 1 1 1 1 I 1 I I I 1 1 1 1 1 1 1 1 1 1 I I I f 1 1 1 1 | 

QKRVGFAAS P CAE I AGAAVS E I S GP FS T EDVASNCVPAQ S L S E P I L 4 68 




Db 


423 





RESULT 15 

US-09-97 8-608A-19 6 

Sequence 196, Application US/09978608A 
Publication No. US2 00300454 62A1 
GENERAL INFORMATION: 



APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 



Ashkenazi , Avi 
Baker Kevin P. 
Botstein, David 
Desnoyers, Luc 
Eaton, Dan 
Ferrara, Napoleon 
Filvaroff, Ellen 
Fong, Sherman 
Gao, Wei-Qiang 
Gerber, Hanspeter 
Gerritsen, Mary E. 
Goddard, Audrey 
Godowski, Paul J. 
Grimaldi, J. Christopher 
Gurney, Austin L. 



APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 



Hillan, Kenneth J 
Kljavin, Ivar J. 
Kuo, Sophia S. 
Napier, Mary A. 
Pan, James; 
Paoni, Nicholas F. 
Roy, Margaret Ann 
Shelton, David L. 
Stewart, Timothy A. 
Tumas, Daniel 
Williams, P. Mickey 
Wood, William I. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C22 

CURRENT APPLICATION NUMBER: US/09/ 978 , 608A 
CURRENT FILING DATE: 2001-10-16 
NUMBER OF SEQ ID NOS : 624 

Prior Application removed - See File Wrapper or Palm 
SEQ ID NO 196 
LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapien 
US-09-97 8-608A-19 6 

Query Match 100.0%; Score 2113; DB 10; Length 518; 

Best Local Similarity 100.0%; Pred. No. 4.3e-203; 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; Gaps 0; 

60 
122 
120 
182 
180 
242 



Qy 


1 


ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

' 1 1 1 1 ' ' N 1 1 1 I 1 I 1 I f 1 1 1 1 I I I | | | I | | | | f | | ( | t | f M 1 I I 1 1 1 1 I 1 I f | | M M 

ALEPALAS PAGAANFLAMVDNLQGDS GRGYYLEMLI GTP PQKLQI LVDTGS SNFAVAGTP 


Db 


63 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 

1 1 1 1 '' ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! | | | | | ! 1 1 1 I 1 1 1 1 1 1 1 II 1 M I 1 1 1 1 1 ! I I M 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


Db 


123 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

1 1 1 1 1 M > 1 1 1 II 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 ! M 1 1 1 1 1 1 1 1 1 1 1 1 I I | M 1 1 1 1 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


Db 


183 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
1 1 M 1 1 ' 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 I 1 1 1 1 1 1 1 I | | | | | | | | M 1 1 1 1 1 1 1 1 | 1 i 1 1 1 1 1 1 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


Db 


243 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

111 1 ! 1 1 1 1 I M 1 1 1 1 1 I I i | | | | | | M 1 ! 1 1 | | | | | | | | | | | | | | 

DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


Db 


303 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 

111 I 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 


Db 


363 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 

' 1 1 1 1 1 I | | | | M 1 1 1 1 I 1 1 1 I I M | | | | | | | | | | | | | | | | 

Q KRVGFAAS P CAE I AGAAVS E I S G P FS T EDVASNC VPAQS L S E P I L 4 68 


Db 


423 



Search completed: March 4, 2004, 15:57:37 
Job time : 46.3511 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title : 

Perfect score 
Sequence : 

Scoring table 



Searched 



March 4, 2004, 15:28:35 ; Search time 58.3085 Seconds 

(without alignments) 
2196.942 Million cell updates/sec 

US-09-668-314C-2_COPY_63 468 
2113 

1 AL E P ALAS P AGAAN FLAMVD STEDVASNCVPAQSLSEPTL 406 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1017041 seqs, 315518202 residues 



Total number of hits satisfying chosen parameter: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1017041 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : 



SPTREMBL 25:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 



sp_archea : * 
sp_bacteria : * 
sp_f ungi : * 
sp_human : * 
sp^invertebrate : * 
sp_mammal : * 
sp_mhc : * 
sp_organelle : * 
sp_phage : * 

sp_plant : * 

sp^rodent : * 

sp_virus : * 

sp_vertebrate : * 

sp_unclassif ied: * 

sp_rvirus : * 

sp_bacteriap : * 

sp_archeap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No . 



% 

Query 

Score Match Length DB 



ID 



Description 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 



2032 
1962 
1962 
1957 
1923 
1801 
1653 
1486.5 
1134 
1132 
1131 
1126 
1017 
997 
614.5 
461 
385 
355 
355 
351 
348 
346. 5 
344 
339. 5 
335. 5 
334.5 
326 
324 . 5 
324 . 5 
322.5 
322 
319. 5 
319 
318 
316 
316 
315 
313. 5 
312. 5 
309 
302 
302 
295 
295 
294 



96 
92 
92 
92 



2 
9 
9 
6 



91. 0 
85.2 



78 
70 
53 
53 
53 
53 
48 
47 
29, 



,2 
, 4 
, 7 
6 
5 
3 
1 
2 
1 



21.8 
18.2 
16.8 
16. 8 



16 
16 
16 
16 
16 
15, 



6 
5 
4 
3 
1 
9 



15. 8 
15.4 



15 
15 
15 
15 
15 
15 



4 
4 
3 
2 
1 
1 



15. 0 
15. 0 
15. 0 
14 . 9 
14.8 
14 . 8 



14 
14 
14 



6 
3 
3 



14.0 
14.0 
13.9 



439 4 Q9H2V8 

514 11 Q9JL18 

514 11 Q8C5E9 

514 11 Q8C793 

423 4 Q8N2D4 

468 4 Q9NZL2 

396 4 Q9NZL1 

500 13 Q7T0Y2 

501 11 Q8C7R1 
532 4 Q9ULS1 
501 11 Q8BQY4 
501 4 Q8IYC8 
255 11 Q9R1P7 
467 11 Q8C4F4 
267 11 Q9CUU5 
213 4 Q9P0D2 
24 4 5 Q8WQY9 
389 13 Q9W643 

389 13 Q9PWK1 
38 9 6 Q9GMY4 
38 9 6 Q9GMY3 

383 13 Q9DEC3 
38 9 6 Q9GMY5 

384 13 Q91322 
38 8 6 Q9GMY2 

391 5 Q9VKP6 
399 13 093458 
372 5 Q9VLK3 
383 13 Q9DE45 
397 13 Q9W6D4 

390 6 Q8SQ41 
387 13 Q9DDV5 
419 5 Q95VA2 

385 13 Q9DEC4 
378 13 Q9PUR9 

392 11 Q9D7R7 

397 13 Q80OA0 
396 13 093428 
354 5 Q9GYX7 

383 5 076856 

384 13 Q9DEC2 

398 13 P87370 
376 13 Q9PUR8 
387 6 Q9GMY8 
413 3 014413 



musculu 
sapien 
sapien 
sapien 



Q9h2v8 homo sapien 
Q9jll8 mus musculu 
Q8c5e9 mus musculu 
Q8c793 mus 
Q8n2d4 homo 
Q9nzl2 homo 
Q9nzll homo 
Q7t0y2 xenopus lae 
Q8c7rl mus musculu 
Q9ulsl homo sapien 

Q8bqy4 mus musculu 
Q8iyc8 homo sapien 
Q9rlp7 mus musculu 
Q8c4f4 mus musculu 
Q9cuu5 mus musculu 
Q9p0d2 homo sapien 
Q8wqy9 aphrocallis 
Q9w643 gallus gall 
Q9pwkl gallus gall 
Q9gmy4 sorex ungui 
Q9gmy3 rhinolophus 

Q9dec3 xenopus lae 
Q9gmy5 suncus muri 

Q91322 rana catesb 
Q9gmy2 oryctolagus 
Q9vkp6 drosophila 

093458 podarcis si 
Q9vlk3 drosophila 
Q9de45 salvelinus 
Q9w6d4 hynobius le 
Q8sq41 canis famil 
Q9ddv5 salvelinus 
Q95va2 clonorchis 
Q9dec4 rana catesb 
Q9pur9 pseudopleur 
Q9d7r7 mus musculu 
Q800a0 rana catesb 
093428 chionodraco 
Q9gyx7 boophilus m 
076856 dictyosteli 
Q9dec2 xenopus lae 
P8737 0 oncorhynchu 
Q9pur 8 pseudopleur 
Q9gmy8 sorex ungui 
014 4 13 pichia angu 



ALIGNMENTS 



RESULT 1 
Q9H2V8 

ID Q9H2V8 PRELIMINARY 
AC Q9H2V8; 

DT 01-MAR-2001 (TrEMBLrel. 
DT 01-MAR-2001 (TrEMBLrel. 
DT 01-OCT-2003 (TrEMBLrel. 



PRT; 439 AA. 
16, Created) 

16, Last sequence update) 
25, Last annotation update) 



DE CDA13 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo 
OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Pheochromocytoma; 

RA Li Y . , Huang Q. , Peng, y, Song H., Yu Y. , Xu S., Ren S., Chen Z 
RA Han Z . ; 

RL Submitted (DEC-1999) to the EMBL/ GenBank/DDBJ databases. 
CC -»- SIMILARITY: BELONGS TO PEPTIDASE FAMILY Al . 
DR EMBL; AF212252; AAG41783.1; -. 
DR HSSP; P00797; 2 REN . 

GO; GO: 0004194; F: pepsin A activity; IEA. 
GO; GO: 0008233; F:peptidase activity; IEA. 
GO; GO:0006508; P : proteolysis and peptidolysis; IEA. 
InterPro; IPR001969; Aspprotease_AS . 
InterPro; IPR001461; Peptidase_Al . 
InterPro; IPR009007; Pept_A_acid. 
DR Pfam; PF00026; asp; 1 . 
DR PRINTS; PR00792; PEPSIN. 
DR PROSITE; PS00141; ASP_PROTEASE; 2. 
KW Aspartyl protease; Hydrolase; Protease. 

SQ SEQUENCE 439 AA; 48275 MW; 02ECOE0E5OF1 1602 CRC64; 

Query Match 96.2%; Score 2032; DB 4; Length 439; 

Best Local Similarity 100.0%; Pred. No. 4.3e-161; 

Matches 389; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 18 MVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHS YIDTYFDTERSSTYR 77 

>' I I I I I I | | | | | | | | M I I I I I f I I I ! f I | | M I I I I I I | | | I | | | | | | | ! | | | | | | f| 
Db 1 MVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYR 60 



DR 
DR 
DR 
DR 
DR 
DR 



Q y 78 SKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIATI FESENFFLPGIKWNGILGL 137 

'I'll I I I I || I I | | | | | | | | | | | | || | m I I I I I I I I M I I I I I I I I I I I I 

SKGFDVTVKYTQGSWTGFVGEDLVT I PKGFNTSFLVNIAT I FESENFFLPGIKWNGILGL 12 0 



Db 61 



QY 138 AYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLY 197 

1 1 I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 AYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLY 18 0 

Q y 198 KGDIWYTPI KEEWYYQI EI LKLEI GGQSLNLDCREYNADKAI VDSGTTLLRLPQKVFDAV 257 

I I • I I I I I I I I I I I I I I I I I I I I I I I I I I I | | || | | | | | | | ( | | | | | | | 

Db 181 KGDIWYTP I KEEWYYQI EI LKLEI GGQSLNLDCREYNADKAI VDSGTTLLRLPQKVFDAV 240 

QY 258 VEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITILPQL 317 

>> I M I I I I I I I I | | I I I I I I I I I | I I I I I I I I I I I I I I M I I I I I I I I I II I I 

Db 241 VELAVARASLIPEFSDGFWTGSQLACWTNSETPWS YFPKISI YLRDENSSRSFRITILPQL 300 

Q y 318 YI QPMMGAGLNYEC YRFGI S P STNALVI GATVMEGFYVI FDRAQKRVGFAAS PCAEI AGA 377 

1 M 1 I I I f I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I | | I I I I I II I I I I I I I I I I I I I 
Db 301 YI QPMMGAGLNYEC YRFGI S P STNALVI GATVMEGFYVI FDRAQKRVGFAAS PCAEI AGA 360 

QY 378 AVSEISGPFSTEDVASNCVPAQSLSEPIL 406 

f I I I I M I I 1 I I I I I I I I I I I I | I I I I I I 
Db 361 AVSEISGPFSTEDVASNCVPAQSLSEPIL 389 



RESULT 2 
Q9JL18 

ID Q9JL18 PRELIMINARY; PRT; 514 AA. 

AC Q9JL18; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Aspartyl protease 1. 

GN BACE2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090 ; 
RN [1] 

RP SEQUENCE FROM N.A. 

RA Choi D.K., Sugano S., Sakaki Y. ; 

RT "Molecular characterization of the mouse Aspl gene, a homolog of the 

RT human ASP1 (Down Syndrome Region aspartyl protease)."; 

RL Submitted (DEC-1999) to the EMBL/ GenBank/DDBJ databases. 

CC SIMILARITY: BELONGS TO PEPTIDASE FAMILY Al . 

DR EMBL; AF216310; AAF36599.1; 

DR HSSP; P00797; 2 REN . 

DR MEROPS; A01.041; -. 

DR MGD; MGI:1860440; Bace2 . 

DR GO; GO: 0004194; F:pepsin A activity; IEA. 

DR GO; GO: 0008233; F:peptidase activity; IEA. 

GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 
InterPro; IPR001969; Aspprotease_AS . 
InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Aspartyl protease; Hydrolase; Protease. 

SQ SEQUENCE 514 AA; 55799 MW; A7 0725F2C1DF5B4 7 CRC64; 

Query Match 92.9%; Score 1962; DB 11; Length 514; 

Best Local Similarity 91.6%; Pred. No. 3.8e-155; 

Matches 372; Conservative 15; Mismatches 15; Indels 4; Gaps 1; 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 60 
'I'' I : M I I I I M I I I I I I t I I I I I I I I I I I I I I I : I | | | | I | | | | | | | | | | 

ALEPVRAT ANFLAMVDNLQGDSGRGYYLEMLIGTPPQKVQILVDTGSSNFAVAGAP 118 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 12 0 
N I : i I II I I I I I I I M | | | | | | | | | | | | | | | | | | || | | | : | | | | | | | , j | | 

HSYIDTYFDSESSSTYHSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNSSFLVNIATIFE 178 

SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 18 0 
M I I I I I I I I I I I I I I | | | | M I I I I I I I I I I | | | | || | | : : | | | | | | | | | | | || || 

SENFFLPGIKWNGILGLAYAALAKPSSSLETFFDSLVAQAKIPDIFSMQMCGAGLPVAGS 238 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 24 0 
I I I M I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I | I I : I I I I I I I | | | | | I I I 



DR 
DR 
DR 



Qy 


1 


Db 


63 


Qy 


61 


Db 


119 


Qy 


121 


Db 


179 


Qy 


181 


Db 


239 



Qy 


241 


DSGTTLLRLPQICVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

NUN 1 1 I I I 1 I | | I I 1 1 I 1 1 1 1 1 I ! I 1 I : | | | | | | | | | 1 M : 1 1 1 1 1 M 1 1 

DSGTTLLRLPQKVFDAWEAVARTSLIPEFSDGFWTGAQLACWTNSETPWAYFPKISIYL 


300 


Db 


299 


358 


Qy 


301 


RDEN5-J 55RS FRTT T T.PnT.VTODMMCarT WVtrr , vr>T7r*T one mvr T t r-r *-> -n m, — m * „ 

i\L/LiiNoox\or t\± i iiir^jj i iy f iyuynj/\(jJLixsj ihjL, YKr GI SPSTNALVIGATVMEGFYVIFDRA 

M M : 1 1 1 1 1 1 I I I | | | | | | | | | | | | f I I I | | | | | | 1 1 1 I I 1 1 1 1 | | | | | I | | : | | | | 
RDENAS RS FRI T I L PQL YI QPMMGAGFN YEC YRFGI S S STNALVI GATVMEGFYWFDRA 


360 


Db 


359 


418 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 II : 1 1 I I 1 I I I : 1 : 1 M 1 
QRRVGFAVSPCAEIEGTTVSEISGPFSTEDIASNCVPAQALNEPIL 4 64 




Db 


419 





DT 
DT 



RESULT 3 
Q8C5E9 

ID Q8C5E9 PRELIMINARY; PRT; 514 AA. 

AC Q8C5E9; 

DT 01-MAR-2003 ( TrEMBLrel . 23, Created) 

01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 
01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
DE Beta-site APP-cleaving enzyme 2. 
GN BACE2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Testis ; 
RX MEDLINE=22354683; PubMed=124 668 51 ; 
RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse trans criptome based on functional annotation of 

RT 60, 770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK078770; BAC37384.1; -. 

DR MGD; MGI: 186044 0; Bace2 . 

GO; GO: 0004194; F:pepsin A activity; IEA. 
GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 
DR InterPro; IPR001969; Aspprotease_AS . 
DR InterPro; IPR001461; Peptidase_Al . 
DR InterPro; IPR009007; Pept_A_acid. 
DR Pfam; PF00026; asp; 1. ~~ 
DR PRINTS; PR00792; PEPSIN. 
DR PROSITE; PS00141; ASP_PROTEASE; 2. 

SQ SEQUENCE 514 AA; 55811 MW; CBB92 37BB68A0B2E CRC64; 

Query Match 92.9%; Score 1962; DB 11; Length 514; 

Best Local Similarity 91.6%; Pred. No. 3.8e-155; 

Matches 372; Conservative 15; Mismatches 15; Indels 4; Gaps 1; 

Qy 1 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLI GT PPQKLQI LVDTGS SNFAVAGT P 60 

c 1,11 1 : 1 I I M I I I I I I I I I I | | | | | I I I I I : I I I I I I I I I I I I I I I I 

Db 63 ALEPVRAT ANFLAMVDNLQGDSGRGYYLEMLIGTPPQKVQILVDTGSSNFAVAGAP 118 



DR 
DR 



Qy 



61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 120 



Db 


119 


1 1 11 11 ' ' ' : 1 1 f 1 1 1 1 I 1 I 1 1 1 f 1 1 If I I f I I I ! I | | | | | | | | j | : If | j | f | | | | | 
HSYIDTYFDSESSSTYHSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNSSFLVNIATIFE 


178 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 
M 1 1 I 1 1 1 1 I I I I I | | | | | | I | | | | | | | | | | | | | | | || | | :: | | | | | | | [| 1 1 M 1 1 

SENFFLPGIKWNGILGLAY7UUAKPSSSLETFFDSLVAQAKIPDIFSMQMCGAGLPVAGS 


180 


Db 


179 


238 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

1 1 11 1 1 1 1 ' 1 1 1 1 1 N N 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 II 1 1 1 | : | | | | | | | 1 I | | 1 II 1 
GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQNLNLDCREYNADKAIV 


240 


Db 


239 


298 


Qy 


241 


DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 
1 1 1 1 1 I 1 1 1 ! 1 I 1 I | | | I I | | | | 1 1 1 1 1 1 1 If 1 1 1 1 : 1 1 1 1 1 I M 1 1 M : 1 1 1 1 I | | | | 

DSGTTLLRLPQKVFDAWEAVTSlRTSLIPEFSDGFWTGAQLACWTNSETPWAYFPKISIYL 


300 


Db 


299 


358 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 
j j i i » i i i i i i i i i i i i i i ■ i i i , , , 

N 1 I : 1 1 1 I 1 1 1 I 1 ! I 1 1 1 1 M I | I | I I | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : | | | | 
RDENASRSFRITILPQLYIQPMMGAGFNYECYRFGISSSTNALVIGATVMEGFYWFDRA 


360 


Db 


359 


418 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 
1 = 11111 | | M | | f | | | | | | | || | | | : | | , | | | | , . , . , , | | 

QRRVGFAVSPCAEIEGTTVSEISGPFSTEDIASNCVPAQALNEPIL 4 64 




Db 


419 





RESULT 4 
Q8C793 

ID Q8C793 PRELIMINARY; PRT; 514 AA 

AC Q8C793; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Beta-site APP-cleaving enzyme 2. 

GN BACE2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Heart ; 

RX MEDLINE=22354683; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 42 0:5 63-573(2002). 

DR EMBL; AK052309; BAC34931.1; -. 

DR MGD; MGI: 1860440; Bace2 . 

GO; GO:0004194; F:pepsin A activity; IEA. 
GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 
InterPro; IPR001969; Aspprotease_AS . 
InterPro; IPR001461; Peptidase_Al . 
DR InterPro; IPR009007; Pept_A_acid. 
DR Pfam; PF00026; asp; 1. 
DR PRINTS; PR00792; PEPSIN. 
DR PROSITE; PS00141; ASP_PROTEASE; 2. 

SQ SEQUENCE 514 AA; 55871 MW; 8BF45E07B0990225 CRC64; 



DR 
DR 
DR 
DR 



Db 



Query Match 92.6%; Score 1957; DB 11; Length 514; 

Best Local Similarity 91.4%; Pred. No. le-154; 

Matches 371; Conservative 15; Mismatches 16; Indels 4; Gaps 1; 

Qy 1 ALE P ALAS P AGAANFLAMVDNLQGDS GRGYYLEMLI GTPPQKLQI LVDTGS SNFAVAGTP 60 

r 1,11 ' : ' I I i I I M I I I I I I I t I I I I I : M I I I I I I | | | | | | | | 

Db 63 ALEPVRAT ANFLAMVDNLQGDSGRGYYLEMLIGTPPQKVQILVDTGSSNFAVAGAP 118 

Q y 61 HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATIFE 120 

„ n 111111111:1 I I II I I I I I I I I I I I I I I | | | | | | | | | | | | | | j | | : | | | | | | | ,, | | 

Db 119 HSYIDTYFDSESSSTYHSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNSSFLVNIATIFE 178 

Qy 121 SENFFLPGI ^NGILGIAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 180 

M I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | || , , | | :: | | | | | | | j | | | | | j | 

179 SENFFLPGIKWNGILGLAYAALAKPSSSLETFFDSLVAQAKIPDIFSMQMCGAGLPVAGS 238 

Qy 181 GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 24 0 

I I I I I I I I I I I I I I I I I I I | | | I I I I i I I I | I | | | | | ( | : | | | | | | | | | | | | , | | 

Db 239 GTNGGSLVXGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQNLNLDCREYNADKAIV 298 

Q y 241 DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQ21iACWTNSETPWSYFPKISI YL 300 

1 1 1 " I I I I I f I I 1 I I I I I I I I I I I I I I I I I I I I f I : M I I I | | | ! | | | : | M I I I I I | 
Db 299 DSGTTLLRLPQKVFDAWEAVARTSLI PEFSDGFWTGAQLACWTNSETPWAYFPKI SI YL 358 

Q y 301 RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 360 

oc 1 1 1 I : I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | I I I I I I I I | | | | | | | : | | | | 
Db 359 RDENAS RS FRTTI LPQL YI QPMMGAGFN YEC YRFGI S S S TNALVI GATVMEGFYWFDRA 418 

361 QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 406 
I : I N I I I I I I I I I M I I I I M I I I I : I I I I I | | | : | : | j | | 
Db 419 QRRVGFAVSPCAEIEGTTVSEI SGPFSTEDIASNCVPAQALNEPIL 464 



RESULT 5 
Q8N2D4 

ID Q8N2D4 PRELIMINARY; PRT; 423 AA. 

AC Q8N2D4; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein OVARC10003 63 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Ovarian carcinoma; 

RA Ota T . , Nishikawa T., Suzuki Y. , Kawai-Hio Y. , Hayashi K. , Ishii S. 

RA Saxto K., Yamamoto J., Wakamatsu A., Nagai T., Nakamura Y., 

RA Nagahari K. , Sugano S., Isogai T.; 

RT "HRI human cDNA sequencing project."; 

RL Submitted (MAR-2002) to the EMBL/ GenBank/DDBJ databases 

DR EMBL; AK075539; BAC11682.1; -. 

DR GO; GO:0004194; F:pepsin A activity; IEA. 

DR GO; GO:0006508; P : proteolysis and peptidolysis ; IEA. 



DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE ; 2. 

KW Hypothetical protein. 

SQ SEQUENCE 423 AA; 46457 MW; 4D4 839F2ED9C2CE1 CRC64; 

Query Match 91.0%; Score 1923; DB 4; Length 423; 

Best Local Similarity 99.2%; Pred. No. 5.2e-152; 

Matches 370; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 

Q ^ 34 MLIGTp PQKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWT 93 

1 1 > M I I I I i I I I I | | | | M I I I I I I I I | | j | | | | | | | M I I II I I | | | | | | | M | | M I 
Db 1 MLIGT PPQKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWT 60 

Q y 94 GFVGEDLVTIPKGFNTSFLvNIATIFESENFFLPGI?CWNGILGIA.YATLAKPSSSLETFF 153 

I I I I I I I I I I I I I I I | | | | | | | | | | | | | I I II I II : I I I I I | | | | | | | | | | I I I I I I I I 
GFVGEDLVTIPKGFNTSFLVNIATIFESGNFFLPGIQWNGILGLAYATLAKPSSSLETFF 12 0 



Db 61 



Q y 154 DS LVTQANI PNVFSMQMCGAGLPVAGSGTNGGS LVLGGI EPSLYKGDI WYTP I KEEWYYQ 213 

1 1 ' I I I I I I I II M I I I M I I M I I I I I M I I I I I I I I | | | | | | I I I I I I I I I I I I I I I 
Db 121 DS LVTQANI PNVFSMQMRGAGLPVAGS GTNGGS LVLGGI EP S LYKGDIWYT P I KEEWYYQ 180 



Q y 214 IEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAVVELAVARASLI PEFSDG 273 

1 > I I I I I I M I I I I I I | | | | | | | | | | | | | M I I I I I I I I I I I I | f | | | | | | | | | | I I I I | 
IEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDG 240 



Db 181 



Qy 274 FWTGSQLACWTNSETPWSYFPKISI YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYR 333 

1 1 1 ' I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | I I I I I I I I I I | | | | 

Db 241 FWTGSQLACWTNSETPWSYFPKISI YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYR 300 

Q ^ 334 FG I S P S TN ALVI GAT VME G F YVI FDRAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS 393 

'< I II I I I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I II f I I I I I I I I | | | | I I I I I I I 
Db 301 FGI S PSTNALVI GATVMEGFYVI FDRAQ KRVG FAAS PCAEIAGAAVS EI SGPFSTEDVAS 360 

Qy 394 NCVPAQSLSEPIL 406 

I I I I I I I I I I I I I 
Db 361 NCVPAQSLSEPIL 373 



RESULT 6 
Q9NZL2 

ID Q9NZL2 PRELIMINARY; PRT; 4 68 AA. 

AC Q9NZL2; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Aspartyl protease. 

GN BACE2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 



RT 
RT 
RL 



RX MEDLINE=2 04224 77; PubMed=10965 1 18 ; 

RA Solans A. , Estivill X., de La Luna S.; 

"A new aspartyl protease on 21q22.3, BACE2, is highly similar to 
Alzheimer's amyloid precursor protein beta-secretase. "; 
Cytogenet. Cell Genet. 8 9:177-184(2 000). 
CC SIMILARITY: BELONGS TO PEPTIDASE FAMILY Al . 

DR EMBL; AF188276; AAF35835.1; 
DR HSSP; P00797; 2 REN . 

GO; GO: 0004194; F:pepsin A activity; IEA. 
GO; GO: 0008233; F:peptidase activity; IEA. 
GO; GO:0006508; P : proteolysis and peptidolysis ; IEA. 
InterPro; IPR001969; Aspprotease_A3 . 
InterPro; IPR001461; Peptidase_Al . 
InterPro; IPR009007; Pept_A_acid. 
DR Pfam; PF00026; asp; 1. 
DR PRINTS; PR00792; PEPSIN. 
DR PROSITE; PS00141; ASP_PROTEASE; 2. 
KW Aspartyl protease; Hydrolase; Protease. 

SQ SEQUENCE 468 AA; 50324 MW; 717E0920126A0142 CRC64; 

Query Match 85.2%; Score 1801; DB 4; Length 468; 

Best Local Similarity 87.7%; Pred. No. 9.3e-142; 

Matches 356; Conservative 0; Mismatches 0; Indels 50; Gaps 1; 



DR 
DR 
DR 
DR 
DR 
DR 



Qy 


1 


ALE P ALAS PAGAAN FLAMVDNLQGDS GRGY YLEMLI GT P PQKLQI LVDT GS SNFAVAGT P 

" M 1 1 1 1 1 1 M 1 f | | | M 1 II 1 1 1 1 1 I 1 1 1 1 1 I I | | | I | M | | | M | | | | | | || 

ALEPALAS PAGAAN FLAMVTtNLQGDSGRGYYLEML I GTP PQKLQI LVDT GSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 

N M M 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 I I 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I 1 M 1 1 1 1 1 
SENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
1 M 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 | | ! | | | | | | | M | | | | | | | | | | | | 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 
1 1 1 M 1 1 1 1 1 | | M || | || | | | | | | | 


300 


Db 


303 


328 


Qy 


301 


RDENS S RS FRIT I LPQLYIQPMMGAGLNYECYRFGI SPSTNALVI GATVMEGFYVI FDRA 

1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 If 1 1 1 1 1 1 1 I I I I | | [ I | | 
LYIQPMMGAGLNYECYRFGISPSTNAL VI GATVMEGFYVI FDRA 


360 


Db 


329 


372 


Qy 


361 


QKRVG FAAS PCAE I AGAAVS EISGPFSTE DVASN CVP AQ S L S E P I L 4 06 

I'll 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M | | | | | | | | | 

QKRVG FAAS PCAEIAGAAVSEI SGP FSTEDVASNCVPAQSLS EPI L 418 




Db 


373 





RESULT 7 
Q9NZL1 

ID Q9NZL1 PRELIMINARY; PRT; 396 AA. 

AC Q9NZL1; 



01-OCT-2000 (TrEMBLrel. 15, Created) 

Ol-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

Aspartyl protease. 

BACE2 . 

Homo sapiens (Human) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
NCBI_TaxID=9 60 6; 
[1] 

SEQUENCE FROM N.A. 

MEDLINE=2 04 2 24 7 7; PubMed=l 0965118; 
Solans A., Estivill X., de La Luna S.; 

"A new aspartyl protease on 21q22.3, BACE2 , is highly similar to 

Alzheimer's amyloid precursor protein beta-secretase. "; 

Cytogenet. Cell Genet. 8 9:177-184(2 000). 

-!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY Al . 

EMBL; AF188277; AAF35836.1; -. 

HSSP; P00797; 2 REN . 

GO; GO: 0004194; Frpepsin A activity; IEA. 

GO; GO: 0008233; F:peptidase activity; IEA. 

GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 

InterPro; IPR001969; Aspprotease_AS . 

InterPro; IPR001461; Peptidase_Al . 

InterPro; IPR009007; Pept_A_acid. 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROSITE; PS00141; ASP_PROTEASE; 2. 

Aspartyl protease; Hydrolase; Protease. 

SEQUENCE 396 AA; 43013 MW; 5023A7AF3 91CEAC9 CRC64; 



Query Match 78.2%; Score 1653; DB 4; Length 396; 

Best Local Similarity 100.0%; Pred. No. 1.7e-129; 

Matches 316; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


i 


ALEPAIiAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M | | I | | | | | I | | | | | { | | | | | 

ALEPALAS PAGAANFLAMVDNLQGDS GRG YYLEMLI GTP PQKLQI LVDTGS SNFAVAGT P 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
I ' 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 I 1 1 1 M 1 1 1 1 1 1 | ! | I | f | | | M 1 1 1 M 1 1 1 1 f I 
HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGI KWNGI LGLAYATLAKP S S SLETFFDSLVTQANI PNVFSMQMCGAGLPVAGS 

>< 1 1 1 1 1 I M 1 1 1 1 1 1 1 I 1 M 1 ! 1 I 1 I 1 1 I I | | | | | | | | | | | | | | | | | | | | | | | | 

SENFFLPGI KWNGI LGLAYATLAXPSS SLETFFDSLVTQANI PNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 
1 1 1 1 1 M 1 II 1 I I 1 I I 1 I I I I I | I | | | | | | | | | | | | | | | | | | | | | f | | , | || ,, | | ,;, , 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 
1 1 < 1 1 1 II 1 1 1 1 1 1 1 1 1 I | | | | M 1 1 1 1 1 1 1 1 1 1 I 1 1 1 II M 1 1 1 1 1 I 1 1 | f | | | | | | | | 
DSGTTLLRLPQKVFDAWEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKISI YL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPQ 316 





1 1 II II 1 1 1 1 1 1 1 1 1 1 



DT 

DT 

DT 

DE 

GN 

OS 

OC 

OC 

OX 

RN 

RP 

RX 

RA 

RT 

RT 

RL 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

SQ 



Db 



363 RDENSSRSFRITILPQ 378 



RESULT 8 
Q7T0Y2 

ID Q7T0Y2 PRELIMINARY; PRT; 500 AA. 

AC Q7T0Y2; 

DT 01-OCT-2003 (TrEMBLrel . 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RX MEDLINE=22341132; PubMed=124 54 9 17 ; 

RA Klein S.L., Strausberg R.L., Wagner L., Pontius J., Clifton S W 

RA Richardson P.; 

RT "Genetic and genomic tools for Xenopus research: The NIH Xenopus 

RT initiative 

RL Dev. Dyn. 225:384-391(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RX MEDLINE-22388257; PubMed=124 77 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy s[ J . , 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P H 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , Butterfield Y S 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.e!,"' 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences. "; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:168 99-16903(2 002) 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RA Klein S., Strausberg R. ; 

RL Submitted (AUG-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC055989; AAH55989.1; 

KW Hypothetical protein. 

SQ SEQUENCE 500 AA; 54722 MW; 1 0F1 6756CAFDCD0B CRC64; 



Query Match 70.4%; Score 1486.5; DB 13; Length 500; 

Best Local Similarity 67.1%; Pred. No. 1.9e-115; 

Matches 273; Conservative 61; Mismatches 70; Indels 3; Gaps 1; 

Q y 3 EPAL AS PAGAAN FLAMVDNLQGDS GRGY YLEML I GT P PQKLQI LVDT GS SNFAVAGT 59 

: 1 1 : 1 1 I I : I I I I I I | | M | II I I : I I I : I I I I : | M I I I I II I I | | | ; 
Db 46 KPG LLLASDPGGTINFFSMVDNLAGDSGRGYYLELLIGSPPQKVNILVDTGSSNFAVAGS 105 

Q y 60 PHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIF 119 

1 : = * I : I I : : I : : I : | : I I I : I M I I I I I : I : I : I : | | | | | : | | : | | | : | | 
Db 106 P^PDVNTFFDSKLSTS YQSLNTEVTVRYTQGSWTGLLGKDWS I PKGVNGTFLINIASI F 165 



Db 



Q y 120 ESENFFLPGIJ<WNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAG 179 

: ' > : I I I I I I I I I I I I I : I I I I I I I I : I I I I , I I I I I I : I I | | | | || I I I 
166 QSESFFLPNINWQGILGLAYSTLAKPSSSVEPFFDSLVQQENIPDVFSMQMCGAGQSSPG 225 

Q y 180 SGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAI 239 

: ' ' I I I t I I I : I I I I I I I : I I I I | | I I I I I | : | : | | I : ! I | | , [ I | I I : | || I 

Db 226 NGINAGSLVLGGVEPSLYKGNIWYTPITEEWYYQVEVLKFEVGGQRLNLDCTVYNSDKAI 2 85 

Q y 240 VDSGTTLLRLPQKVFDAVVE1AVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI Y 299 

I I I I I I I I I I I I I I : I : I : | : : | | | | : | | | I | | i , - - | | : | | | , | , , 

Db 286 VDSGTTLLRLPDKVFNAMVDAIVQTSLIQNFNAEFWAGLQLACWDKTQQPWNYFPDISIY 345 

Q y 300 L^DENS S RS FRI T I LPQL YI QPMMGAGLN YEC YRFGI S P STNALVI GATVMEGFYVI FDR 359 

I I I I : I I I I I : I : I I I I I I : : : I : M I M I : | | | I I I I I I I I I | | | | | 

Db 346 LRD ™TSRSFRLTLKPQLYIQSVLTFQESLNCFRFGISQSASTLVI GATVMEGFYVI FDR 405 

Qy 360 AQKRVG FAAS P CAE I AGAAVS E I S GP FS T E DVASNCVPAQ S L S E P I L 4 06 

I : I I I I I I I I I I : : I Il||:||| | | | : | f| : I I I | : 

Db 406 AEKRVGFAVSSCAEVSGITVSEIAGPFGTSDVSSNCIARNPLREPIM 452 



DT 
DT 
DE 



RESULT 9 
Q8C7R1 

ID Q8C7R1 PRELIMINARY; PRT; 501 AA. 

AC Q8C7R1; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 
01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
Beta-site APP cleaving enzyme. 
GN BACE . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 
OX NCBI_TaxI D=l 0 090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Spinal cord; 
RX MEDLINE=22354 683; PubMed=12 4 6685 1 ; 
RA The FANTOM Consortium, 

the RIKEN Genome Exploration Research Group Phase I & II Team; 
"Analysis of the mouse transcriptome based on functional annotation of 
RT 60,770 full-length cDNAs . " ; 
RL Nature 420:563-573(2002). 
DR EMBL; AK049626; BAC33844.1; 



RA 
RT 



DR MGD; MGI : 134 6542; Bace. 

GO; GO: 0004194; F:pepsin A activity; IEA. 
GO; GO:0006508; P : proteolysis and peptidolysis; IEA. 
InterPro; IPR001969; Aspprotease_AS . 
InterPro; IPR001461; Peptidase_Al . 
InterPro; IPR009007; Pept_A_acid. 
DR Pfam; PF00026; asp; 1. 
DR PRINTS; PR00792; PEPSIN . 
DR PROSITE; PS00141; ASP_PROTEASE; 1. 

SEQUENCE 501 AA; 55761 MW; B4 10DA8B64 64 7 663 CRC64; 



DR 
DR 
DR 
DR 
DR 



SQ 



Query Match 53.7%; Score 1134; DB 11; Length 501; 

Best Local Similarity 54.0%; Pred. No. 5.2e-86; 

Matches 209; Conservative 64; Mismatches 110; Indels 4; Gaps 2- 



Qy 


9 


PAGAANFLAMVDNLQGDSGRGYYLEMLI GTP PQKLQI LVDTGS SNFAVAGTPHS YI DT YF 
1 : 1 : N 1 1 1 : 1 I | : | | | : | | | | : | | | | | | | | | | | | | | | | , , . . , . 

PGRRGS FVEMVDNLRGKSGQGYYVEMTI GS PPQTLNI LVDTGS SNFAVGAAPHPFLHRYY 


68 


Db 


54 


113 


Qy 


69 


DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESFNFFT Pf 


1 O Q 
L/LO 




1 1 A 
-L X ft 


: 1 1 1 1 1 1 1 1 1 1 1 1 1 : I | | | : | | | | : 111,1:11: 

QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 


173 


Qy 


129 


I KWNGI LGLAYATLAKP S S S LET FFD S LVTQANI PNVFSMQMCGAGLPV AGSGTNGG 

' ' 1 1 1 1 1 1 1 : I : I 1 1 I 1 1 f II 1 I : | | | : | | : | : | | | | | : : | , 

SNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTEALASVGG 


185 


Db 


174 


233 


Qy 


186 


SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 
' : : : 1 ' 1 = III 1 :|||ll: IMM: | : : : | | | | | : | | : | || 1 1 : 1 1 1 I I I I 

SMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTT 


245 


Db 


234 


293 


Qy 


246 


LLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENS 
1111 = 111:11::: II .'Mill 1 II II 1 II : II 1 1 : || | : 
NLRLPKKVFEAAVKSIKAAS STEKFPDGFWLGEQLVCWQAGTTPWNI FPVI SLYLMGEVT 


305 


Db 


294 


353 


Qy 


306 


SRS FRITI LPQL YIQPMMGAGLNY- EC YRFGI S PSTNALVI GATVMEGFYVI FDRAQKRV 
> 1 M 1 1 1 1 M 1 : : 1 : : : I | : | : | | : 1 : 1 1 : I I I 1 1 1 : 1 1 1 I : I | : 
NQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWFDRARKRI 


364 


Db 


354 


413 


Qy 


365 


GFAASPCAEIAGAAVSEISGPFSTEDV 391 




Db 


414 


Ml 1 1 : : Ml 1 I: 

G FAVS ACH VH D E FRT AAVE G P FVT ADM 44 0 





RESULT 10 
Q9ULS1 

ID Q9ULS1 PRELIMINARY; PRT; 532 AA. 

AC Q9ULS1; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-OCT-2001 (TrEMBLrel. 18, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein KIAA1149 (Fragment) . 

GN KIAA114 9. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI TaxID=9606; 



RN 


r 1 1 




RP 


POT TIT "NT f - " "FT" IT'DfYM' M A 




DP 


iibou ij— 15 r a in ; 




DV 


ntujLXiNtj— z uujybio ; PubMed=l 057 4 461; 




KM. 


Hirosawa M. , Nagase T . , Ishikawa K., Kikuno R. , Nomura N . , 


Ohara 0 . ; 


Kl 


"Characterization of cDNA clones selected by the GeneMark 


analysis 


RT 


rrom size rractionated cDNA libraries from human brain. " ■ 




TTW 7X "Doc; C . QOQ Q^C M QQQ\ 

jjrj-M. kc s. d . ozy- Joo ( lyyy ] . 




CC 


oh'iijjakii I . DbiiUiNLrb 1U FhjFIlDASE FAMILY Al . 






imyidLi/ AnUJzy /o; BAAtfo4o3.2; — . 




fir 


noof, rooZ/Z; IAMo . 






GO, GO.UUU4194; F:pepsin A activity; IEA. 




JJK 


GO; GO: 0008233; F:peptidase activity; IEA. 




UK 


GO, GO: 0006508; P : proteolysis and peptidolysis ; IEA. 




TlR 


interfro, IPR001969; Aspprotease AS. 




DR 


InterPro; IPR001461; Peptidase Al. 




UK 


InterPro; IPR009007; Pept A acid. 




DR 


Pfam; PF00026; asp; 1. 




DR 


PRINTS; PR00792; PEPSIN. 




DR 


PROSITE; PS00141; ASP PROTEASE; 1. 




KW 


Hypothetical protein; Aspartyl protease; Hydrolase; Protease. 


FT 


NON TER 1 1 




SQ 


SEQUENCE 532 AA; 58720 MW; 98B135D0D5FBD2E8 CRC64 ; 





Query Match 53.6%; Score 1132; DB 4; Length 532; 

Best Local Similarity 54.0%; Pred. No. 8.3e-86; 

Matches 209; Conservative 65; Mismatches 109; Indels 4; Gaps 2 

9 PAGAANFLAWDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 68 
I : ' : M M I : I f I : I I I : I I : | : | | | | I I I || f ! I I I I I M : : I : 

85 PGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYY 144 

69 DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG 128 
: ' I I I I I I I I | | | | : | | I | : | | | | : j | | , 

145 QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 204 

129 IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS GTNGG 185 

I I I I I I I I I : I : I III I I I I I I I : : | | : | | : | : M I I I : I : II 

2 05 SNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGG 2 64 

186 SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 24 5 

I : : : I I I : III I : I II I I : I I I I : : I : : : I | | | | : | | : | | | | | : | | M I I I 
2 65 SMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTT 324 

246 LLRLPQKVFDAVAAEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENS 305 

I I I I : I I I = I I : : : II : I I I I I I I I I I I I I : I I I I : I I I : 

325 NLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVT 384 

306 SRSFRITILPQLYIQPMMGAGLNY-ECYRFGISPSTNALVIGATVMEGFYVIFDRAQKRV 364 

: : I 1 I I I I I I I I : : I : : : I I : I II I : | : | | : | | | | | | : | | | | : I I : 

385 NQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKRI 444 

365 GFAASPCAEIAGAAVSEISGPFSTEDV 391 

IN i I : : I I I I I : 

4 45 GFAVSACHVHDEFRTAAVEGPFVTLDM 471 



QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



RESULT 11 




Q8BQY4 




ID 

*x_ 


s^od^-l^ if t\ cj Li ± iyi ± jn x ; JrKJ. ; 501 AA. 




AC 


Q8BQY4 ; 




DT 


01-MAR-2003 (TrEMBLrel. 23, Created) 






01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 




i-J X 


01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 




DE 


Beta-site APP cleaving enzyme. 




GN 


BACE. 




OS 


Mus mus cuius (Mouse) . 




oc 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 


ox 


NCBI TaxID=10090; 




RN 


[1] 




Dp 
IX XT 


SEQUENCE FROM N.A. 




Rf 


° 1 rvT-j-iN — o / £5±j/ do , iibbUtj-Drain; 






MEDLINE=22354 683; PubMed=124 6685 1 ; 




RA 


The FANTOM Consortium, 




RA 


the RIKEN Genome Exploration Research Group Phase I & II 


Team; 


RT 


"Analysis of the mouse trans criptome based on functional 


annotation 


RT 

X % _L 


60,770 full-length cDNAs . " ; 


RL 


Nature 42 0:563-573(2 002). 






EMBL; AK046175; BAC32 620.1; -. 






MGD; MGI: 1346542; Bace. 






GO; GO:0004194; F:pepsin A activity; IEA. 




DR 


GO; GO:0006508; P : proteolysis and peptidolysis; IEA. 




DR 


InterPro; IPR001969; Aspprotease AS. 




DR 


InterPro; IPR001461; Peptidase Al . 




DR 


InterPro; IPR009007; Pept A acid. 




DR 


Pfam; PF00026; asp; 1. 




DR 


PRINTS; PR00792; PEPSIN. 




DR 


PROSITE; PS00141; ASP PROTEASE; 1. 




SQ 


SEQUENCE 501 AA; 55816 MW; C0855513145E024E CRC64; 





of 



Query Match 53.5%; Score 1131; DB 11; Length 501; 

Best Local Similarity 53.7%; Pred. No. 9.2e-86; 

Matches 208; Conservative 65; Mismatches 110; Indels 4; Gaps 



Qy 


9 


PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 
1 : 1 : 1 M 1 1 : 1 1 1 : 1 1 1 : I I : 1 : I | | | 1 1 1 1 I I || | | | | | | : : | s 

PGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYY 


68 


Db 


54 


113 


Qy 


69 


DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG 
: ' 1 1 1 1 1 1 MM 1 1 U llhll 1 1 : III | ||: ||: | 
QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 


128 


Db 


114 


173 


Qy 


129 


IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV AGSGTNGG 

1 1 1 1 1 1 1 M : | : , 1 1 1 1 1 1 1 1 1 1 : 1 1 | : | | : | : | | I | | : : | | 

SNWEGILGLAY7VEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTE7VLASVGG 


185 


Db 


174 


233 


Qy 


186 


SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 
1 : : : 1 N : IN 1 : 1 1 I I | : | | | | : : | : : : | | | | | : | | : | | | | | : { j | | | | | 

SMI IGGI DHS LYTGRLWYTPI RREWYYEVI I VRVEINGQDLKMDCKEYN YDKS I VDSGTT 


245 


Db 


234 


293 


Qy 


246 


LLRLPQPCVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENS 
1 I M : 1 1 1 : 1 1 : : : If : 1 1 1 1 1 1 1 1 1 1 I 1 1 : 1 1 1 1 : 1 1 1 : 
NLRLPK?CVFELAAVT<SIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVT 


305 


Db 


294 


353 



3 °6 S RS FRIT I LPQLYIQPMMGAGLN Y- ECYRFGI S PSTNALVI GATVMEGFYVI FDRAQKRV 364 
-MINIMI I : : I : : : I I : I : I I : | : | | : I I I | | | : | t I I : I I : 

Db 354 NQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWFDRARKRI 413 



Qy 365 GFAASPCAEIAGAAVSEISGPFSTEDV 391 

I I I ! I : : III I I: 

Db 414 G FAVS ACH VH D E FRTAAVE G P FVTADM 44 0 



RESULT 12 
Q8IYC8 

ID Q8IYC8 PRELIMINARY; PRT; 501 AA. 

AC Q8IYC8; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Beta-site APP-cleaving enzyme. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo 
OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC TISSUE=Brain; 
RA Strausberg R. ; 

RL Submitted (JUL-2002) to the EMBL/ GenBank/DDBJ databases 
DR EMBL; BC036084; AAH36084.1; 

DR GO; GO: 0004194; F:pepsin A activity; IEA. 

GO; GO: 0006508; P : proteolysis and peptidolysis; IEA. 
InterPro; IPR001969; Aspprotease_AS . 
InterPro; IPR001461; Peptidase_Al . 
InterPro; IPR009007; Pept_A_acid. 
DR Pfam; PF00026; asp; 1 . 
DR PRINTS; PR00792; PEPSIN. 
DR PROSITE; PS00141; ASP_PROTEASE; 1. 

SQ SEQUENCE 501 AA; 55824 MW; 768595CF5517EFB7 CRC64; 

Query Match 53.3%; Score 1126; DB 4; Length 501; 

Best Local Similarity 53.7%; Pred. No. 2.4e-85; 

Matches 208; Conservative 65; Mismatches 110; Indels 4; Gaps 2; 

PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 68 
I s I s I M M = I I I : I I I : I I : I : | | | | | | | | | | | | | | | | | | : : | : 



DR 
DR 
DR 
DR 



Qy 


9 


Db 


54 


Qy 


69 


Db 


114 


Qy 


129 


Db 


174 


Qy 


186 


Db 


234 



HI I I I I I I I I : I I I I : I I I I : I I I I II : I I 



I M I I I I I I : I : I M I I I I I I I I : : I I : M : I : | | | | | : | 



I : : : I I I : Ml I = 11111: MM:: I : : : | | | | | : | | : | | | M : I I M M I 



Qy 


246 


Db 


294 


Qy 


306 


Db 


354 


Qy 


365 


Db 


414 



1111 = 111:1 I : : : II : I I I I | | | | | | I I I : I I M : I I I 



MINIMI I : : I : : : f | : | | | f : | : | | : | | | | | | : || | | : | | : 



III II : : I I I I I : 



RA 
RT 



RESULT 13 
Q9R1P7 

ID Q9R1P7 PRELIMINARY; PRT; 255 AA. 

AC Q9R1P7; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Aspartyl protease (Fragment) . 

GN BACE2 . 

OS Mus musculus (Mouse). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N. A. 

Accarino M. , Fumagalli P., Taramelli R. , Ottolenghi S.; 

"Cloning of a gene from chromosome 21 Down Region encoding a potential 
RT transmembrane protease. "; 

RL Submitted (FEB-1998) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF051150; AAD45964.1; -. 

DR MEROPS; A01.041; -. 

DR MGD; MGI : 1860440; Bace2. 

GO; GO: 0004190; F : aspartic-type endopeptidase activity; IEA. 
GO; GO: 0008233; F:peptidase activity; IEA. 
DR GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 
DR InterPro; IPR001969; Aspprotease_AS . 
DR InterPro; IPR009007; Pept_A__acid . 
DR PROSITE; PS00141; ASP_PROTEASE; 1. 
KW Protease. 

FT NON_TER 1 1 

SQ SEQUENCE 255 AA; 28685 MW; 53DE3 17 8 15 996D63 CRC64 ; 

Query Match 48.1%; Score 1017; DB 11; Length 255; 

Best Local Similarity 92.2%; Pred. No. l.le-76; 

Matches 189; Conservative 9; Mismatches 7; Indels 0; Gaps 0; 

Qy 2 °2 WYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAV 2 61 

I I I I I I M I I I I I M II I I I I I I : f I I I I i I I I I I I I I I I I I I I I I | | t I I I I i I I M I I 
Db 1 WYTPIKEEWYYQIEILKLEIGGQNLNLDCREYNADKAIVDSGTTLLRLPQKVFDAVVEAV 60 

Qy 26 2 ARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI YLRDENSSRSFRITILPQLYIQP 321 

H Mill M : I | | | | | | | | | | | : | | | | | | | | | | | | | : | | | I M ! | , I I 

Db 61 ARTSLIPEFSDGFWTGAQLACWTNSETPWAYFPKISI YLRDENASRSFRITILPQLYIQP 120 



DR 
DR 



Qy 322 MMGAGLNYECYRFGISPSTNALVIGATVTVTEGFWIFDRAQKRVGFAASPCAEIAGAAVSE 381 

I I I I I I I I I I I II I I I I I I I i I I I | | I | | | | | : | | | || : j | | | | | || | | | | j | | 

Db 121 HMGAGFNYECYRFGISSSTNALVIGATVMEGFYVVFDRAQRRVGFAVSPCAEIEGTTVSE 18 0 

QY 382 ISGPFSTEDVASNCVPAQSLSEPIL 406 

I I I M I I I I : I I I I I I I I : I : I f I I 
181 ISGPFSTEDIASNCVPAQALNEPIL 205 



RESULT 14 
Q8C4F4 

ID Q8C4F4 PRELIMINARY; PRT; 467 AA. 

AC Q8C4F4; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Beta-site APP cleaving enzyme. 

GN BACE . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Cerebellum; 

RX MEDLINE-22354683; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK082317; BAC38462.1; -. 

DR MGD; MGI: 1346542; Bace. 

DR GO; GO: 0004194; F:pepsin A activity; IEA. 

GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 
InterPro; IPR001969; Aspprotease_AS . 
InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid . 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE ; 1. 

SQ SEQUENCE 467 AA; 52063 MW; 31AB674FF1843652 CRC64; 

Query Match 47.2%; Score 997; DB 11; Length 467; 

Best Local Similarity 48.1%; Pred. No. 1.3e-74; 

Matches 186; Conservative 59; Mismatches 104; Indels 38; Gaps 3; 

QY 9 PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 68 

I s I s M M I : I I I : I I I : I I : I : I I I | | | | | | | | | | | | | | | : : | : 

Db 54 PGRRGSFVT1MVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYY 113 

QY 69 DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG 128 

: I I I I I I I I I M I I : I I I I : I I I I : I I I I M : I I : I 

Db H4 QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 173 



DR 
DR 
DR 



Qy 



129 IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV AGSGTNGG 185 



I I I I I I I I f : I : I III I i I I I I I : I I I : I I : I : | | | | | : : | I 

Db 17 4 SNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHI PNIFSLQLCGAGFPLNQTEALASVGG 233 

1Q 6 SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 245 
I : : : I I I : III I : I I I I I : I I I I : : f : : : j | | f | : | | : | 

Db 234 SMIIGGIDHSLYTGSLWYTPIRREWYYEVI IVRVEINGQDLKMDCKE 280 

246 LLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENS 305 

:| Mil I II M Ml: M Ihll I : 

UJD 281 TEKFPDGFWLGEQLVCWQAGTTPWNI FPVI S LYLMGEVT 319 

Qy 306 SRSFRITILPQLYIQPMMGAGLNY-ECYRFGISPSTNALVIGATVMEGFYVI FDRAQKRV 364 

"MINIMI I : : I : : : I I : | : | | : | : | | : | | | I I I : I I I I : | | : 

Db 320 NQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYVVFDRARKRI 379 

Qy 365 GFAASPCAEIAGAAVSEISGPFSTEDV 391 

Mill : : III I I: 

Db 38 0 G FAVS AC HVHDE FRTAAVE G P FVT ADM 4 06 



RESULT 15 
Q9CUU5 

ID Q9CUU5 PRELIMINARY; PRT; 2 67 AA. 

AC Q9CUU5; 

DT 01-JUN-2001 ( TrEMBLrel . 17, Created) 
DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 
DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Adult male brain cDNA, RIKEN full-length enriched library, 
DE clone: 3526402A15 product : beta-site APP cleaving enzyme, full insert 
DE sequence (Fragment) . 

GN BACE . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Brain ; 

RA Adachi J., Aizawa K. , Akahira S., Akimura T., Aral A., Aono H. , 

RA Arakawa T., Bono H., Carninci P., Fukuda S., Fukunishi Y. , Furuno 

RA Hanagaki T., Hara A., Hayatsu N., Hiramoto K . , Hiraoka T., Hori F 

RA Imotani K., Ishii Y. , Itoh M. , Izawa M. , Kasukawa T., Kato H., 

RA Kawai J., Kojima Y., Konno H., Kouda M. , Koya S., Kurihara C, 

RA Matsuyama T., Miyazaki A., Nishi K., Nomura K. , Numazaki R./ohno 

RA Okazaki Y. , Okido T., Owa C, Saito H., Saito R. , Sakai C, Sakai K 

RA Sano H., Sasaki D., Shibata K. , Shibata Y., Shinagawa A., Shiraki T 

RA Sogabe Y., Suzuki H., Tagami M. , Tagawa A., Takahashi F . , Tanaka T 

RA Tejima Y., Toya T . , Yamamura T., Yasunishi A., Yoshida K. , Yoshino 

RA Muramatsu M. , Hayashizaki Y. ; 

RL Submitted (JUL-2000) to the EMBL/ GenBank/DDBJ databases 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Brain ; 

RX MEDLINE=22354683; PubMed=12 4 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

"Analysis of the mouse transcriptome based on functional annotation of 



M. 



M 



r 

M 



RT 



RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 
RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TI SSUE=Brain ; 

RX MEDLINE=21085660; PubMed=l 12 17 851 ; 

RA RIKEN FAN TOM Consortium; 

RT "Functional annotation of a full-length mouse cDNA collection "; 

RL Nature 4 09:685-690(2 001). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TlSSUE=Brain; 

RX MEDLINE=99279253; PubMed=1034 9 636 ; 

RA Carninci P., Hayashizaki Y. ; 

RT "High-efficiency full-length cDNA cloning."; 

RL Meth. Enzymol. 303:19-44(1999). 

RN [5] 

RP SEQUENCE FROM N.A. 

RC STRAIN— C57BL/ 6J; TISSUE=Brain ; 

RX MEDLINE=2 04 99374; PubMed=11042 159 ; 

RA Carninci P., Shibata Y. , Hayatsu N. , Sugahara Y. , Shibata K. , Itoh M. , 

RA Konno Okazaki Y., Muramatsu M. , Hayashizaki Y. ; 

RT "Normalization and subtraction of cap-trapper-selected cDNAs to 

RT prepare full-length cDNA libraries for rapid discovery of new genes 

RL Genome Res. 10:1617-1630(2000). 

RN [6] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Brain; 

RX MEDLINE=20530913; PubMed=11076861 ; 

Shibata K. , Itoh M. , Aizawa K. , Nagaoka S., Sasaki N., Carninci P., 
Konno H., Akiyama J., Nishi K. , Kitsunai T., Tashiro H., Itoh M. , 

RA Sumi N., Ishii Y. , Nakamura S., Hazama M. r Nishine T . , Harada A.' 

RA Yamamoto R. , Matsumoto H. , Sakaguchi S., Ikegami T., Kashiwagi k! , 

RA Fujiwake S., Inoue K. , Togawa Y., Izawa M. , Ohara E., Watahiki M. , 

RA Yoneda Y. , Ishikawa T . , Ozawa K., Tanaka T., Matsuura S., Kawai J., 

RA Okazaki Y. , Muramatsu M. , Inoue Y., Kira A., Hayashizaki Y.; 

RT "RIKEN integrated sequence analysis (RISA) system-384-f ormat 

RT sequencing pipeline with 384 multicapillary sequencer."; 

RL Genome Res. 10:1757-1771(2000). 

DR EMBL; AK014390; BAB29317.2; -. 

DR MGD; MGI: 1346542; Bace. 

GO; GO:0004194; Frpepsin A activity; IEA. 
GO; GO:0006508; P : proteolysis and peptidolysis ; IEA. 
InterPro; IPR001461; Peptidase_Al . 
InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

FT NON__TER 1 1 

SQ SEQUENCE 267 AA; 30333 MW; 94 13EB4530AB63B0 CRC64 ; 

Query Match 29.1%; Score 614.5; DB 11; Length 267; 

Best Local Similarity 52.4%; Pred. No. 4.9e-43; 

Matches 108; Conservative 41; Mismatches 56; Indels 1; Gaps 1; 

Qy 187 LVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTL 246 

: : : N I : IN I : I I II I : MM:: | : : : | | | | | : | | : | | | | | : | | | | | | | 
Db 1 MI I GGI DHSLYTGSLWYT PI RREWYYEVI I VRVEINGQDLKMDCKEYNYDKS I VDSGTTN 60 



RA 
RA 



DR 
DR 
DR 
DR 



247 LRLPQKVFDAVVEAVARAS LI PEFS DGFWTGSQLACWTNSETPWS YFPKI S I YLRDENS S 306 
I I M : I I I : I I : : : II : I I ! I I I I f I f I I I : I I ! I : | | | : : 

Db 61 LRL PKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTN 120 

QY 307 RS FRITI LPQLYI QPMMGAGLNY- ECYRFGI S PSTNALVI GATVMEGFYVI FDRAQKRVG 365 

s I I I I I I I I I I : : I : : : I I : I : ! I : I : I I : I || I I I : I | | | : | | : | 

Db 121 QSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYVVFDRARKRIG 180 

QY 366 FAASPCAEIAGAAVSEISGPFSTEDV 391 

I'll : : | | | | | : 

181 FAVSACHVHDEFRTAAVEGP FVTADM 206 



Search completed: March 4, 2004, 15:38:53 
Job time : 59.3085 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd 



OM protein - protein search, using sw model 
Run on : 



Title : 

Perfect score: 
Sequence : 



March 4, 2004, 15:22:30 ; Search time 12.9574 Seconds 

(without alignments) 

1631.532 Million cell updates/sec 

US-09-668-314C-2_COPY_63 468 
2113 

1 ALEPALAS PAGAANFLAMVD STEDVASNCVPAQSLSEPIL 406 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database : 



SwissProt 42 : * 



Pred. No. xs the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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P07339 
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mus musculu 
homo sapien 
oryctolagus 
homo sapien 
gallus gall 
mus musculu 
macaca fuse 
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oryctolagus 
macaca fuse 
homo sapien 
pan troglod 
oryctolagus 
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oryctolagus 
oryctolagus 
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gallus gall 
cavia porce 
homo sapien 
macaca mula 
callithrix 
equus cabal 
sus scrofa 
bos taurus 
oryctolagus 
ovis aries 



ALIGNMENTS 



RESULT 1 
BAE2_HUMAN 

ID BAE2_HUMAN STANDARD ; PRT; 518 AA. 

AC Q9Y5Z0; Q9UJT6; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Beta secretase 2 precursor (EC 3.4.23.45) (Beta-site APP-cleaving 

enzyme 2) (Aspartyl protease 1) (Asp 1) (ASP1) (Membrane-associated 
aspartxc protease 1) (Memapsin-1) (Down region aspartic protease). 



DE 
DE 

GN BACE2 OR ASP21. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=20057170; PubMed=105 912 13 ; 

RA Yan R. , Bienkowski M. J . , Shuck M.E., Miao H., Tory M.C., Pauley A.M 

RA Brashier J.R., Stratman N.C., Mathews W.R., Buhl A.E., Carter D.B., 

RA Tomasselli A.G., Parodi L.A., Heinrikson R.L., Gurney M.E.; 

RT "Membrane-anchored aspartyl protease with Alzheimer's disease 

RT beta-secretase activity."; 

RL Nature 4 02:533-537(1999). 



RN [2] 

RP SEQUENCE FROM N.A. 
RC TISSUE=Bone marrow; 

RA Xin H., Stephans J.C., Duan X., Harrowe G., Kim E . , Grieshammer U. , 
RA Giese K. ; 

RT "Identification of a novel aspartic-like protease differentially 

RT expressed in human breast cancer cell lines."; 

RL Submitted (JAN-1999) to the EMBL/ GenBank/DDB J databases 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Accarino M.P., Fumagalli P., Ottolenghi S., Taramelli R. ; 

RT "Cloning of a gene from chromosome 21 Down region encoding a potential 

RT transmembrane aspartyl protease. "; 

RL Submitted (FEB-1998) to the EMBL/ GenBank/ DDB J databases 
RN [4] 

RP SEQUENCE FROM N.A. 

RA Solans A., Estivill X., de la Luna S.; 

RT "Cloning of a novel mammalian aspartyl protease."; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20120043; PubMed=1065 62 50 ; 

RA Hussain I., Powell D.J., Howlett D.R., Tew D.G., Meek T.D. 

RA Chapman C, Gloger I.S., Murphy K.E., Southan CD., Ryan d!m 

RA Smith T.S., Simmons D.L., Walsh F.S., Dingwall C, Christie G. ; 

RT "Identification of a novel aspartic proteinase (Asp 2) as 

RT beta-secretase . " ; 

RL Mol. Cell. Neurosci. 14:419-427(1999). 
RN [6] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20144060; PubMed=106774 83 ; 

RA Lin X., Koelsch G. , Wu S., Downs D . , Dashti A., Tang J. ; 

RT "Human aspartic protease memapsin 2 cleaves the beta-secretase site of 

RT beta-amyloid precursor protein."; 

RL Proc. Natl. Acad. Sci . U.S.A. 97:1456-1460(2000) 

RN [7] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20289799; PubMed=10830953 ; 

RA Hattori M. , Fujiyama A., Taylor T.D., Watanabe H., Yada T . , 

RA Park H.-S., Toyoda A., Ishii K. , Totoki Y. , Choi D.-K., Groner Y 

RA Soeda E . , Ohki M. , Takagi T., Sakaki Y., Taudien S., Blechschmidt ' K w 

RA Polley A. , Menzel U., Delabar J., Kumpf K ., Lehmann R. , Patterson D w 

RA Reichwald K., Rump A., Schillhabel M. , Schudy A., Zimmermann W 

RA Rosenthal A., Kudoh J., Shibuya K. , Kawasaki Asakawa S. 

RA Shmtam A w Sasaki T., Nagamine K. , Mitsuyama s., Antonarakis S.E 

RA Minoshima S., Shimizu Nordsiek G. , Hornischer K. r Brandt P 

RA Scharfe M. , Schoen O., Desario A., Reichelt J. , Kauer G. , Bloecker H 

RA Ramser J., Beck A., Klages S., Hennig S., Riesselmann L. f Dagand E. , ' 

RA Wehrmeyer S., Borzym K. , Gardiner K. , Nizetic D., Francis F 

RA Lehrach H., Reinhardt R. r Yaspo M.-L.; ' 

RT "The DNA sequence of human chromosome 21. M ; 

RL Nature 405:311-319(2 000). 

RN [8] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Skin; 

RX MEDLINE=22388257; PubMed=124 7 7 932 ; 

Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 



RA 



RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RT 

RT 

RL 

RN 

RP 

RX 

RA 

RA 

RT 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 



Klausner R.D., Collins F.S., Wagner L., Shenmen CM-, Schuler G 
Altschul S.F., Zeeberg B . , Buetow K.H., Schaefer C.F., Bhat N.K 



D 



I., Wang J., Hsieh F. , 
Rubin G.M., Hong L . , 



Casavant T 
, Carninci 
Ab rams on R 



T.E 



L. , Scheetz 
P . , Prange C . , 
D. , Mullahy S.J 
Gunaratne P.H. , 



Hopkins R.F., Jordan H., Moore T., Max S 
Diatchenko L., Marusina K., Farmer A. A. , 
Stapleton M. , Soares M.B., Bonaldo M.F., 
Brownstein M.J., Usdin T.B., Toshiyuki S 
Raha S.S., Loquellano N . A. , Peters G.J., 
Bosak S.A., McEwan P.J., McKernan K. J. , ^Malek J. A. 

W^f* W °^ ley K *°" ^ S " GarC±a A - M " Ga ^ L.J.THuTyk'siw., 

Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R A 

J \', Helt ° n E " Ketteman M - Madan A., Rodrigues S., Sanchez A., 
Whiting M., Madan A., Young A. C , Shevchenko Y. , Bouffard G G 
Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C, 
Rodriguez A.C., Grimwood J., Schmutz J., Myers R M 
Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D E 
Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 
"Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences. "; 

Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002) 
[9] 

CHARACTERIZATION. 

MEDLINE=22088158; PubMed=12093293 ; 

Turner R.T. Ill, Loy J. A. , Nguyen C, Devasamudram T., Ghosh A K 
Koelsch G., Tang J. ; 

"Specificity of memapsin 1 and its implications on the design of 
memapsm 2 (beta-secretase ) inhibitor selectivity."; 
Biochemistry 41:8742-8746(2002). 

-!- CATALYTIC ACTIVITY: Broad endopeptidase specificity. Cleaves Glu- 
Val-Asn-Leu-|-Asp-Ala-Glu-Phe in the Swedish variant of 
Alzheimer's amyloid precursor protein. 

-!- SUBCELLULAR LOCATION: Type I membrane protein. 

-!- SIMILARITY: Belongs to peptidase family Al . 

Hit SWI5 f- PR f . entr * is copyright. It is produced"through _ 7 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
^ non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; AF200342; 
EMBL; AF1178 92; 
EMBL; AF050171; 
EMBL; AF178532; 
EMBL; AF204944; 
EMBL; AF200192; 
EMBL; AL1632 84; 
EMBL; AL1632 85; 



AAF17078 . 1 
AAD45240 . 1 
AAD45963. 1 
AAF29494.1 
AAF26368.1 
AAF13714 . 1 
CAB90458. 1 
CAB90554 . 1 
AAH14453. 1 



EMBL; BC014453; 
HSSP; P00797; 2 REN . 
MEROPS; A01.041; -. 
Genew; HGNC : 9 3 4 ; BACE2 . 
MIM; 605668; -. 

GO; GO: 0005624; C:membrane fraction; TAS . 

GO; GO: 0004190; F : aspartic- type endopeptidase activity; TAS 
GO; GO:0006464; P:protein modification; TAS. 



DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



GO; GO: 0009306; Prprotein secretion; TAS . 
InterPro; IPR001969; Aspprotease_AS. 
InterPro; IPR009007; Pept_A_acid. 
InterPro; IPR001461; Peptidase_Al . 
Pfam; PF00026; asp; 1. 
PROSITE; PS00141; ASP_PROTEASE; 2. 

Hydrolase; Aspartyl protease; Glycoprotein; Zymogen; Transmembrane; 
Signal. 



SIGNAL 

PROPEP 

CHAIN 

DOMAIN 

TRANSMEM 

DOMAIN 

ACT^SITE 

ACT_SITE 

CARBOHYD 

CARBOHYD 

CONFLICT 

SEQUENCE 



1 

21 
7 

21 
474 
495 
110 
303 
170 
366 
36 
518 AA, 



20 
7 

518 
473 
494 
518 
110 
303 
170 
366 
36 



POTENTIAL . 

POTENTIAL. 

BETA SECRETASE 2. 

EXTRACELLULAR (POTENTIAL) 

POTENTIAL. 

CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 
BY SIMILARITY. 
N-LINKED (GLCNAC. . .) 
N-LINKED (GLCNAC. . .) 
A -> T (IN REF. 6) . 



(POTENTIAL) . 
(POTENTIAL) . 



56180 MW; 2E903150 8237 60D3 CRC64; 



Query Match 100.0%; Score 2113; DB 1; Length 518; 

Best Local Similarity 100.0%; Pred. No. 1.6e-153; 

Matches 406; Conservative 0; Mismatches 0;' Indels 0; 



Gaps 



0; 



Qy 


1 


ALEPALAS PAGAANFLAMVDNLQGDS GRGYYLEMLI GTPPQKLQI LVDTGSSNFAVAGT P 

1 1 1 ' 1 1 I f 1 1 1 I 1 1 1 1 M 1 1 1 1 1 1 I I I I I | | | | | | | | | | | | | i | | | | t | , | ,,,,, M , , 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 


60 


Db 


63 


122 


Qy 


61 


HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 
1 M 1 " 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 f 1 1 1 1 1 I 1 f 1 1 1 || MUM 

HSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFE 


120 


Db 


123 


182 


Qy 


121 


SENFFLPGIKWNGILGLAYATIAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 
'< < 1 1 1 1 1 1 II 1 1 I I I | | | | | | | | | | | | | | | | | | | | I I 1 1 I M 1 1 1 1 1 1 

SENFFLPGIPCWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS 


180 


Db 


183 


242 


Qy 


181 


GTNGGSLVLGGIEPSL YKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 

111 ' I 1 1 1 1 1 1 1 1 1 I | M | | 1 1 1 M 1 ! 1 1 1 1 1 1 1 II 1 II II M M 1 1 1 1 f 1 1 1 1 1 

GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIV 


240 


Db 


243 


302 


Qy 


241 


DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 

1 1 1 1 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 II 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

DSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYL 


300 


Db 


303 


362 


Qy 


301 


RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRA 
111 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 

RDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVI FDRA 


360 


Db 


363 


422 


Qy 


361 


QKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPIL 4 06 

t ' 1 M 1 1 M | | | 1 II 1 1 1 1 I I 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 
QKRVGFAAS P CAE I AGAAVS E I S G P FS T EDVASNC VPAQ S LS E P I L 468 




Db 


423 





RESULT 2 
BACE_RAT 

ID BACE_RAT STANDARD; PRT; 501 AA. 



AC P56819; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Beta-secretase precursor (EC 3.4.23.46) (Beta-site APP cleaving 

enzyme) (Beta-site amyloid precursor protein cleaving enzyme) 

DE (Aspartyl protease 2) (Asp 2) (ASP2) (Membrane-associated aspartic 

DE protease 2) (Memapsin-2) . 

GN BACE . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae • Rattus 

OX NCBI_TaxID=10116; murinae, Rattus. 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20002972; PubMed=10531052 ; 

Vassar R. , Bennett B.D., Babu-Khan S., Kahn S., Mendiaz E A 
Denis P., Teplow D.B., Ross S., Amarante P., Loeloff R. , Luo'y 
Fisher S., Fuller J., Edenson S., Lile J., Jarosinski M.A 
RA Biere A.L., Curran E . , Burgess T., Louis J.-C, Collins F 
RA Treanor J., Rogers G., Citron M. ; ' 

"Beta-secretase cleavage of Alzheimer's amyloid precursor protein bv 
the transmembrane aspartic protease BACE."; 
RL Science 2 8 6:735-741(1999). 

CC -!- FUNCTION: Responsible for the proteolytic processing of the 

amyloid precursor protein (APP) . cleaves at the amino terminus of 
the A-beta peptide sequence, between residues 671 and 672 of APP 
leads to the generation and extracellular release of beta-cleaved 
soluble APP, and a corresponding cell-associated carboxy-terminal 
fragment which is later release by gamma-secretase (By 
CC similarity) . 

CC CATALYTIC ACTIVITY: Broad endopeptidase specificity. Cleaves Glu- 

Val-Asn-Leu-| -Asp-Ala-Glu-Phe in the Swedish variant of 
cc Alzheimer's amyloid precursor protein. 

CC SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 



RA 
RA 
RA 



RT 
RT 



CC 
CC 
CC 
CC 
CC 



CC 



CC This SWISS-PROT^ entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

rr US h-^ Y w n °^ ofit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

or send an email to license@isb-sib . ch) 

CC 

DR EMBL; AF190727; AAF04144.1; -. 

DR HSSP; P32329; 1YPS . 

DR MEROPS; A01.004; 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR009007; Pept_A_acid. 

DR InterPro; IPR001461; Peptidase_Al . 

DR Pfam; PF00026; asp; 1. ~ 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP PROTEASE; 1. 



KW Hydrolase; Aspartyl protease; Glycoprotein; Zymogen; Transmembrane; 
i\W bignal . 

FT SIGNAL 1 21 POTENTIAL. 



FT PROPEP 

FT CHAIN 

FT DOMAIN 

FT TRANSMEM 

FT DOMAIN 

FT ACT_SITE 

FT ACT_SITE 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT CARBOHYD 

FT CARBOHYD 

FT CARBOHYD 

FT CARBOHYD 

SQ SEQUENCE 



22 

46 

22 
458 
479 

93 
289 
216 
278 
330 
153 
172 
223 
354 
501 AA, 



45 
501 
457 
478 
501 

93 
289 
420 
443 
380 
153 
172 
223 
354 

55806 



POTENTIAL. 
BETA-SECRETASE. 
EXTRACELLULAR (POTENTIAL) 
POTENTIAL. 

CYTOPLASMIC ( POTENTIAL ) . 
BY SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
N-LINKED (GLCNAC. . 
N-LINKED ( GLCNAC . . 
N-LINKED (GLCNAC. . 
N-LINKED (GLCNAC. . 



BY 
BY 
BY 
BY 



) 



MW; 24B445BC8BE87DE3 CRC64; 



(POTENTIAL) 
( POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 



Query Match 53.7%; Score 1134; DB 1 ; Length 501; 

Best Local Similarity 53.7%; Pred. No. 6e-79; 

Matches 208; Conservative 65; Mismatches 110; Indels 



4; Gaps 



2; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



9 PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 68 
I : I : I I I I I : I I I : I I I : | | : | : I I I | | | | | | | | | | | | | | | :: | . 

54 PGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYY 113 

69 DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG 128 
: 1 1 1 1 1 I I M II I I : | | | | : M | | : | | | | , | . j , . , 

114 QRQLSSTYRDLRKSVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 173 

129 IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV AGSGTNGG 185 

' I I I M I M : I : I I I I I I I I I I I : | | | : | | : | : | | | | | : . , , 

174 SNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTEALASVGG 233 

18 6 SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 245 

1 : : : ' 1 1 : 'I' I =11111: MM:: | : : : || II I : I I : || I II : M II I II 
234 SMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTT 293 

24 6 LLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENS 305 

11 1 1 : 1 I I : I I : : : M : I Mil i M II || | : (I | | : | | | : 

2 94 NLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNI FPVI SLYLMGEVT 353 

306 S RS FRI T I LPQL YIQPMMGAGLN Y- ECYRFGI S PSTNALVI GATVMEGFYVI FDRAQKRV 364 

: : 1 1 1 1 1 1 1 11 1 : : I : : : II : I : I I : | : | I M I I I II : M M : I I : 

354 NQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWFDRARKRI 413 

365 GFAASPCAEIAGAAVSEI SGPFSTEDV 391 

III I I : : III I I: 

414 GFAVSACHVHDEFRTAAVEGPFVTADM 440 



RESULT 3 
BACEJVIOUSE 

ID BACE_MOUSE STANDARD; PRT; 501 AA 

AC P56818; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Beta-secretase precursor (EC 3.4.23.46) (Beta-site APP clea 



DE 

DE 

DE 

GN 

OS 

OC 

OC 

OX 

RN 

RP 

RX 

RA 

RA 

RA 

RA 

RA 

RT 

RT 

RL 

RN 

RP 

RA 

RL 

RN 

RP 

RX 

RA 

RA 

RA 

RT 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 



enzyme) (Beta-site amyloid precursor protein cleaving enzyme) 
(Aspartyl protease 2) (Asp 2) (ASP2) (Membrane-associated aspartic 
protease 2) (Memapsin-2) . 
BACE . 

Mus musculus (Mouse) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 
NCBI_TaxID=10090 ; 
[1] 

SEQUENCE FROM N . A. 

MEDLI NE=2 0 002972; PubMed= 10531052; 

Vassar R. , Bennett B.D., Babu-Khan S., Kahn S., Mendiaz E A 
Denis P., Teplow D.B., Ross S., Amarante P., Loeloff R. , Luo'y 
Fisher S., Fuller J., Edenson S., Lile J., Jarosinski M.A. , 
Biere A.L., Curran E . , Burgess T . , Louis J.-C, Collins F 
Treanor J. , Rogers G., Citron M. ; 

"Beta-secretase cleavage of Alzheimer's amyloid precursor protein by 
the transmembrane aspartic protease BACE."; 
Science 2 8 6:735-741(1999). 
[2] 

REVISIONS TO 6 AND 81-87. 

Bennett B.D., Vassar R. , Citron M. ; 

Submitted (JAN-2000) to the EMBL/ GenBank/DDBJ databases. 
[ 3 J 

SEQUENCE FROM N.A. 

MEDLINE=2 005717 0; PubMed=10591213 ; 

Yan R., Bienkowski M. J. , Shuck M.E., Miao H., Tory M.C., Pauley A M 
Brashier J.R., Stratman N.C., Mathews W.R., Buhl A. E . , Carter D B 
Tomasselli A.G., Parodi L.A., Heinrikson R.L., Gurney M.E ; 
"Membrane-anchored aspartyl protease with Alzheimer's disease 
beta-secretase activity. 
Nature 4 02:533-537(19 9 9). 

-!- FUNCTION: Responsible for the proteolytic processing of the 

amyloid precursor protein (APP) . Cleaves at the amino terminus of 
the A beta peptide sequence, between residues 671 and 672 of APP 
leads to the generation and extracellular release of beta-cleaved 
soluble APP, and a corresponding cell-associated carboxy-terminal 
fragment which is later release by gamma-secretase (By 
similarity) . 

-!- CATALYTIC ACTIVITY: Broad endopeptidase specificity. Cleaves Glu- 
Val-Asn-Leu-|-Asp-Ala-Glu-Phe in the Swedish variant of 
Alzheimer's amyloid precursor protein. 

- SUBCELLULAR LOCATION: Type I membrane protein 

- TISSUE SPECIFICITY: Brain. 

- SIMILARITY: Belongs to peptidase family Al . 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on 
use by non-profit institutions as long as its content 
modified and this statement is not removed. 



its 



is m no way 

.... . Usage by and for commercial 

entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



or send an email to license@isb-sib . ch ) 



EMBL; AF190726; AAF04143.2; -. 
EMBL; AF200346; AAF17082.1; 
HSSP; P56272; 1AM5 . 



DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 



MEROPS; A01.004; 

MGD; MGI: 1346542; Bace. 

InterPro; IPR001969; Aspprotease_AS . 

InterPro; IPR009007; Pept_A_acid. 

InterPro; IPR001461; Pepticiase_Al . 

Pfam; PF00026; asp; 1 . 

PRINTS; PR007 92; PEPSIN. 

PROSITE; PS00141; ASP PROTEASE; 1. 



KW 


Sicrnal 






FT 


SIGNAL 


i 

J. 


O 1 


FT 


PROPEP 




H D 


FT 


CHAIN 


46 


501 


FT 


DOMAIN 


22 


457 


FT 


TRANSMEM 


458 


478 


FT 


DOMAIN 


479 


501 


FT 


ACT SITE 


93 


93 


FT 


ACT_SITE 


289 


289 


FT 


DISULFID 


216 


420 


FT 


DISULFID 


278 


443 


FT 


DISULFID 


330 


380 


FT 


CARBOHYD 


153 


153 


FT 


CARBOHYD 


172 


172 


FT 


CARBOHYD 


223 


223 


FT 


CARBOHYD 


354 


354 


SQ 


SEQUENCE 


501 AA; 


55747 



POTENTIAL. 

POTENTIAL. 

BETA-SECRETASE. 

EXTRACELLULAR ( POTENTIAL ) 

POTENTIAL. 

CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
N-LINKED (GLCNAC. . 
N-LINKED (GLCNAC. . 
N-LINKED (GLCNAC. . 
N-LINKED (GLCNAC. . 



(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 



MW; C085A013145E474E CRC64; 

Query Match 53.6%; Score 1133; DB 1; Length 501; 

Best Local Similarity 53.7%; Pred. No. 7.1e-79; 

Matches 208; Conservative 65; Mismatches 110; Indels 



4; Gaps 



2; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



9 PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 68 

' =1 * I M I I : I I I :l I I: I I : I: I I I M M I I I I || :: |: 

54 PGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYY 113 

69 DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG 128 
s I I I I I I I I I II I I : I I | | : | | | | : M I I I I : | | : | 

114 QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 173 

129 I KWNGI LGLAYATLAKP S S S LET FFDS LVTQANI PNVFSMQMCGAGLPV AGSGTNGG 185 

11 ' : I : I I M I : I I I : I I : I : I I | | | : : | | 

174 SNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNI FSLQLCGAGFPLNQTE1ALASVGG 233 

186 SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 245 

' 1 : 1 1 1 1 : 111 1 : M I I I : MM:: | : : : | I | | | : M : I II I I : I I I I II I 
234 SMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTT 293 

246 LLRLPQKVFDAWEAVTU^ASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENS 305 

nnA I I I I M I I : I I : : : I I : \ Ml I I : I I I I : I I I : 

294 NLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVT 353 

306 S RS FRI T I L PQL YI QPMMGAGLN Y- EC YRFGI S P STNALVI GATVMEGF YVI FDRAQKRV 364 

: 2 I I M M I M I : : I : : : I | : | ; | | : | . | | M II I I I : || | | : | | : 

354 NQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWFDRARKRI 413 

365 GFAASPCAEIAGAAVSEISGPFSTEDV 391 
I I I I I : : || | | J : 



Db 414 G FAVS ACHVH D E FRT AAVE G P FVT ADM 440 



DT 
DT 
DE 
DE 
DE 
DE 



RESULT 4 
BAC E_HUMAN 

ID BAC E_HUMAN STANDARD; PRT; 501 AA. 

AC P56817; Q9BYB9; Q9BYC0 ; Q9BYC1; Q9UJT5 * 
DT 30-MAY-2000 (Rel. 39, Created) 

30-MAY-2000 (Rel. 39, Last sequence update) 
10-OCT-2003 (Rel. 42, Last annotation update) 
DE Beta-secretase precursor (EC 3.4.23.46) (Beta-site APP cleaving 
enzyme) (Beta-site amyloid precursor protein cleaving enzyme) 
(Aspartyl protease 2) (Asp 2) (ASP2 ) (Membrane-associated aspartic 
protease 2) (Memapsin-2 ) . 
GN BACE OR BACE1 . 
OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhinx; Hominidae; Homo 
OX NCBI_TaxID=9 606; 
RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM A) . 
RC TISSUE=Brain; 

RX MEDLINE=20002972; PubMed=10531052 ; 

RA Vassar R. , Bennett B.D., Babu-Khan S., Kahn S., Mendiaz E.A. , 
RA Denis P., Teplow D.B., Ross S., Amarante P., Loeloff R., Luo Y. , 
RA Fisher S., Fuller J., Edenson S., Lile J . , Jarosinski M.A. , 
RA Biere A.L., Curran E., Burgess T., Louis J.-C, Collins F 
RA Treanor J., Rogers G. , Citron M. ; 

"Beta-secretase cleavage of Alzheimer's amyloid precursor protein by 
tne transmembrane aspartic protease BACE."; 
Science 2 8 6:735-741(1999). 
RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM A), SEQUENCE OF 46-68 AND 
RP CHARACTERIZATION. 
RC TISSUE=Brain; 

RX MEDLINE=20057171; PubMed-105912 14 ; 

RA Sinha S., Anderson J. P., Barbour R., Basi G.S., Caccavello R., 

RA avis D Doan M. , Dovey H.F., Frigon N., Hong J., Jacobson-Croak K., 

RA Jewett N., Kexm P., Knops J. , Lieberburg I., Power M. , Tan H 

S l^l Un °r, G ''I Ung J " SChenk D " Seubert p - Suomensaari S.M./wang S. 

RA Walker D., Zhao J., McConlogue L., Varghese J.; 

pt " Pur± f icati ° n ^nd cloning of amyloid precursor protein beta-secretase 

Ki rrom human brain."; 

RL Nature 4 02:537-54 0(1999). 
RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM A) . 

RX MEDLINE=20057170; PubMed=l 05912 13 ; 

RA Yan R. , Bienkowski M.J., Shuck M.E., Miao H., Tory M.C., Pauley A.M., 

RA Brashier J.R., Stratman N.C., Mathews W.R., Buhl A.E., Carter D.B., 

RA Tomasselli A.G., Parodi L . A. , Heinrikson R.L., Gurney M E • 

RT "Membrane-anchored aspartyl protease with Alzheimer's disease beta- 

RT secretase activity. "; 

RL Nature 4 02:533-537(199 9). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM A) . 

RX MEDLINE=20120043; PubMed-1 065 6250 ; 

RA Hussain I., Powell D.J., Howlett D.R., Tew D.G., Meek T.D. 



RT 
RT 
RL 



RT 
RT 
RT 



RA Chapman C, Gloger I.S., Murphy K.E., Southan CD., Ryan D M 
RA Smith T.S., Simmons D.L., Walsh F.S., Dingwall C, Christie g' ; 
RT "Identification of a novel aspartic proteinase (Asp 2) as beta- 
RT secretase."; 

RL Mol. Cell. Neurosci. 14:419-42 7(19 99) 
RN [5] 

RP SEQUENCE FROM N . A. (ISOFORM B) . 
RC TISSUE^Brain, and Pancreas; 

RA Michel B., De Pietri Tonelli D., Zacchetti D., Keller P.; 
RT "New beta-site APP cleaving enzyme isoform (BACE-1B) obtained from 
RT human brain and pancreas."; 

RL Submitted (JAN-2001) to the EMBL/GenBank/DDB J databases. 
RN [ 6] 

RP SEQUENCE FROM N.A. (ISOFORM C) . 
RC TISSUE=Pancreas ; 

RA Zacchetti D., De Pietri Tonelli D., Schnurbus R. ; 

RT "New beta-site APP cleaving enzyme isoform (BACE-1C) obtained from 
RT human pancreas."; 

RL Submitted (JAN-2001) to the EMBL/GenBank/DDB J databases. 
RN [ 7 ] 

RP SEQUENCE FROM N.A. (ISOFORMS B; C AND D) 
RC TISSUE^Brain; 

RX MEDLINE=21408467; PubMed=l 15 16562 ; 
RA Tanahashi H., Tabira T.; 

"Three novel alternatively spliced isoforms of the human beta-site 
amyloid precursor protein cleaving enzyme (BACE) and their effect on 
amyloid beta-peptide production."; 
RL Neurosci. Lett. 307:9-12(2001). 
RN [8] 

RP SEQUENCE OF 14-501 FROM N.A. (ISOFORM A), AND CHARACTERIZATION 
RX MEDLINE=20144060; PubMed=106774 83 ; 

RA Lin X., Koelsch G., Wu S., Downs D., Dashti A., Tang J. ; 

"Human aspartic protease memapsin 2 cleaves the beta-secretase site of 
beta-amyloid precursor protein."; 

Proc. Natl. Acad. Sci. U.S.A. 97:1456-14 60(2000) 
RN [9] 

RP DISULFIDE BONDS. 

RX MEDLINE=21950860; PubMed=119534 58 ; 

RA Fischer F., Molinari M. , Bodendorf U. , Paganetti P.; 

RT "The disulphide bonds in the catalytic domain of BACE are critical but 
RT not essential for amyloid precursor protein processing activity 
RL J. Neurochem. 80:1079-108 8(2002). ' 
CC -!- FUNCTION: Responsible for the proteolytic processing of the 

a ^ yloid P^cursor protein (APP) . Cleaves at the amino terminus of 
the A-beta peptide sequence, between residues 671 and 672 of APP 
leads to the generation and extracellular release of beta-cleaved 
soluble APP, and a corresponding cell-associated carboxy-terminal 
fragment which is later release by gamma-secretase . 

CATALYTIC ACTIVITY: Broad endopeptidase specificity. Cleaves Glu- 
Val-Asn-Leu-|-Asp-Ala-Glu-Phe in the Swedish variant of 
CC Alzheimer's amyloid precursor protein. 

CC -f- SUBCELLULAR LOCATION: Type I membrane protein 
CC ALTERNATIVE PRODUCTS: 

CC Event-Alternative splicing; Named isoforms=4; 

CC Name=A; Synonyms=BACE-lA, BAC-501; 

CC IsoId=P56817-l; Sequence=Displayed; 

CC Name=B; Synonyms =BACE- IB, BACE-I-476; 



RT 
RT 
RL 



CC 
CC 
CC 
CC 
CC 
CC 
CC 



cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 



IsoId=P56817-2; Sequence=VSP 005223; 
Name=C; Synonyms=BACE-lC, BACE-I-4 57; ' 

IsoId=P56817-3; Sequence=VSP 005222; 
Name=D; Synonyms=BACE-lD, BACE-I-432 ; 

IsoId=P56817-4 ; Sequence=VSP 005222 
TISSUE SPECIFICITY: Brain. 
SIMILARITY; Belongs to peptidase family Al . 



VSP_005223; 



baleen t h l Z 7 Ifc is Produced through a collaboration 

between the wiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bicmformatics Institute. There are no restriction on its 
h J**, n °r P ^° fit institutions as long as its content is in no way 
enSSes Tau at f^ at ±S not — "«ge by and for o- rc 

or S arSa^'to ^ S \*^™^ (See http://www.isb-sib.ch/announce/ 

or sena an email to license@isb-sib . ch) 

EMBL; AF190725; AAF04142.1 

EMBL; AF201468; AAF18982.1 

EMBL; AF200343; AAF17079.1 

EMBL; AF2 04 943; AAF2 63 67.1 

EMBL; AF338816; AAK38374.1 

EMBL; AF338817; AAK38375.li 

EMBL; AB050436; BAB40931.li 

EMBL; AB050437; BAB40932.l' 

EMBL; AB050438; BAB40933.1, 

EMBL; AF200193; AAF13715.1; 

PIR; A59090; A59090. 

PDB; 1M4H; 28-AUG-02. 
MEROPS; A01.004; 

Genew; HGNC:933; BACE. 
MIM; 604252; -. 

GO; GO: 0005887; C:integral to plasma membrane; TAS 
GO; GO: 0008798; F: be t a -a spa rtyl -peptidase activity; TAS 
GO; GO: 0009405; P : pathogenesis ; TAS. 

GO; GO: 0006508; P : proteolysis and peptidolysis ; TAS 
InterPro; IPR001969; Aspprotease_AS . 
InterPro; IPR009007; Pept_A_acid. 
InterPro; IPR001461; Peptidase_Al . 
Pfam; PF00026; asp; 1. 
PRINTS; PR00792; PEPSIN. 
PROSITE; PS00141; ASP_PROTEASE; 1. 

Hydrolase; ^Aspartyl protease; Glycoprotein; Zymogen; Transmembrane; 

Lcmg; 3D-structure . 
POTENTIAL. 

BETA-SECRETASE . 
EXTRACELLULAR (POTENTIAL) . 
POTENTIAL . 

CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 
BY SIMILARITY. 



FT 


SIGNAL 


1 


21 


FT 


PROPEP 


22 


45 


FT 


CHAIN 


46 


501 


FT 


DOMAIN 


22 


457 


FT 


TRANSMEM 


458 


478 


FT 


DOMAIN 


479 


501 


FT 


ACT SITE 


93 


93 


FT 


ACT_SITE 


289 


289 


FT 


DISULFID 


216 


420 


FT 


DISULFID 


278 


443 


FT 


DISULFID 


330 


380 


FT 


CARBOHYD 


153 


153 


FT 


CARBOHYD 


172 


172 


FT 


CARBOHYD 


223 


223 



N-LINKED (GLCNAC. 
N-LINKED (GLCNAC. 
N-LINKED (GLCNAC. 



(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 



FT CARBOHYD 354 354 N-LINKED ( GLCNAC . . .) (POTENTIAL) 

FT VARSPLIC 146 189 Missing (in isoform C and isoform D) . 

FT /FTId=VSP_005222 . 

FT VARSPLIC 190 214 Missing (in isoform B and isoform D) . 

FT /FTId=VSP_005223 . 

SQ SEQUENCE 501 AA; 55763 MW; 377CE4C82 4ACEF05 CRC64; 

Query Match 53.6%; Score 1132; DB 1; Length 501; 

Best Local Similarity 54.0%; Pred. No. 8.5e-79; 

Matches 209; Conservative 65; Mismatches 109; Indels 4; Gaps 2- 



Qy 


9 


PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYF 
1 : 1 : M 1 1 1 : 1 1 1 : | | | : | | : | : | | | | | | | | | | | | | | , , , , . . , . 

PGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYY 


Db 


54 


Qy 


69 


DTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG 
: 1 1 ' 1 M 1 1 : I I | | : | | | | : 1 1 I | | | : | | : | 

QRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFING 


Db 


114 


Qy 


129 


IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS GTNGG 

1 MINIM : 1 : 1 III Mill! | : : If : | | : | : | | | | | : | : | | 
*-<.iii r» j. ±j\j ±±r\. j. /^Cj _L.fT_r\i£r LJLJ o XjIj t n £ L, VKQ rnVPNLFS LQLCGAGFPLNQ S EVLAS VGG 


Db 


174 


Qy 


186 


SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTT 
1 = = • 1 M : 1 1 1 1 : 1 1 1 1 1 : MM:: | : : : | | | | | : | | : | | | 1 I : | | 1 1 1 M 
SMI I GGIDHS LYTGS LWYT P I RREWYYEVI I VRVEI NGQDLKMDCKEYN YDKS IVDSGTT 


Db 


234 


Qy 


246 


LLRLPQWFDAVVEAVARASLIPEFSDGF^TGSQLACWTNSETPWSYFPKISIYLRDENS 
1 1 1 1 : 1 1 1 : 1 1 : : : | | : 1 ! 1 II I | | M 1 1 1 : 1 1 1 I : I I I : 
NLRL P KKVFEAAVKS I KAAS S TEKFP DGFWLGEQLVCWQAGTT PWN I FP VI S L YLMGEVT 


Db 


294 


Qy 


306 


SRSFRITILPQLYIQPMMGAGLNY-ECYRFGI SPSTNALVIGATVMEGFYVIFDRAQKRV 
: : 1 1 1 1 1 1 1 M 1 = : 1 : : : 1 1 : I II I : MM : 1 M M M I M M 1 M 

NQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKRI 


Db 


354 


Qy 


365 


GFAASPCAEIAGAAVSEISGPFSTEDV 391 


Db 


414 


1 1 1 1 1 : : MM |: 
GFAVSACHVHDEFRTAAVEGPFVTLDM 44 0 



RESULT 5 
PEPC_MACFU 

ID PEPC_MACFU STANDARD; PRT; 377 AA 

AC P03955; 

DT 23-OCT-1986 (Rel. 02, Created) 

01-AUG-1992 (Rel. 23, Last sequence update) 
28-FEB-2003 (Rel. 41, Last annotation update) 

Gastricsin precursor (EC 3.4.23.3) (Pepsinogen C) (Fragment). 



DT 
DT 
DE 

GN PGC . 

OS 
OC 



Macaca fuscata fuscata (Japanese macaque) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi • 
OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 
OC Cercopithecinae; Macaca. 
OX NCBI_TaxID=954 3; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Gastric mucosa; 



RX MEDLINE=92 037 64 5; PubMed^l 935977 ; 
RA Kageyama T . , Tanabe K., Koiwai O.; 

RT "Development-dependent expression of isozymogens of monkey 
RT pepsinogens and structural differences between them."; 
RL Eur. J. Biochem. 202:205-215(1991). 
RN [2] 

RP SEQUENCE OF 6-377. 

RX MEDLINE=8 6168133; PubMed=3514597 ; 
RA Kageyama T., Takahashi K. ; 

RT "The complete amino acid sequence of monkey progastricsin . " ; 
RL J. Biol. Chem. 261:44 06-4419(1986). 
RN [3] 

RP SEQUENCE OF 6-65. 

RX MEDLINE=85289106; PubMed=3928607 ; 

RA Kageyama T., Takahashi K. ; 

RT "Monkey pepsinogens and pepsins. VII. Analysis of the activation 

RT process and determination of the NH2-terminal 60-residue sequence of 

RT Japanese monkey progastricsin, and molecular evolution of 

RT pepsinogens . " ; 

RL J. Biochem. 97:1235-12 4 6(1985). 

CC -\~ CATALYTIC ACTIVITY: More restricted specificity than pepsin A, but 
CC shows preferential cleavage at Tyr- | -Xaa bonds; high activity 

CC towards hemoglobin as substrate. 

CC PTM: Each pepsinogen is converted to corresponding pepsin at pH 

CC 2.0 in part as a result of the release of a 47 aa activation 

CC segment and in part as a result of stepwise proteolytic cleavage 

cc via an intermediate form(s). 

CC -!- MISCELLANEOUS: The expression of pepsinogen genes is regulated by 

cc hormones and related substances. 

CC -!- SIMILARITY: Belongs to peptidase family Al. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; X59754; CAA42426.1; -. 

DR PIR; S19683; PEMQCJ. 

DR HSSP; P20142; 1AVF. 

DR MEROPS; A01.003; 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR009007; Pept_A_acid. 

DR InterPro; IPR001461; Peptidase_Al . 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Hydrolase; Aspartyl protease; Zymogen; Digestion; Signal 

FT NON_TER 11 

FT SIGNAL <1 5 

FT PROPEP 6 31 ACTIVATION PEPTIDE. 

FT PROPEP 32 4 8 ACTIVATION PEPTIDE 

FT CHAIN 49 377 GASTRICSIN. 

FT DISULFID 93 98 

FT DISULFID 256 260 



FT DISULFID 299 332 

FT ACT_SITE 80 80 

FT ACT_SITE 2 65 2 65 

FT CONFLICT 331 331 Y -> V (IN REF 2) 

FT CONFLICT 349 349 L -> LVY (IN REF. 2). 

SQ SEQUENCE 377 AA; 41148 MW; 2CFB8 F8BF2 6D77CE CRC64; 

Query Match 17.0%; Score 359.5; DB 1; Length 377; 

Best Local Similarity 30.1%; Pred. No. 4.1e-20; 

Matches 112; Conservative 60; Mismatches 111; Indels 89; Gaps 17; 

30 YYLEMLIGTPPQKLQILVDTGSSNFAV AGTPHS YIDTYFDTERSSTYRSKGF 81 

' : ' : I I I I I I : | | | | | | | | || ( , | . | | | | • | 

Db 62 YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHS RFNPSES STYSTNGQ 117 

Q y 82 DVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG IKWNGILG 136 

: : : I M I I I I I : | : | | (I) | , . . . , | . , 

Db 118 TFSLQYGSGSLTGFFGYDTLTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMG 170 

QY 137 LAYATLAKP S S S LET FFDSLVTQANI PN- VFSMQMCGAGLPVAGS GTNGGS LVLGGI EP S 195 

Mill: : : I : I : : : : | | : : I : : I I : ■ I I I * * I 

Db 171 LAYPTLSVDGAT— TAMQGMVQEGALTSPI FSVYLSDQ QGSSGGAWFGGVDSS 222 

Qy 196 LYKGDIWYTp IKEEWYYQIEILKLEIGGQSLNLDCREYN7VDKAIVDSGTTLLRLPQKVFD 255 

M I I : : I : : | | : | | | : | | | | : | ( : I I I I : || : | ( : || : 

Db 223 L YTGQI YWAPVTQELYWQIGIEEFLIGGQASGW-CSE — GCQAIVDTGTSLLTVPQQYMS 279 

Q y 256 AWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYF PKI S I YLRDENS 305 

M : : I I I : | I : I I : : : 
Db 280 ALLQA TGAQ EDEYGQFLVNCNSIQNLPTLTFII 312 

Q y 306 SRSFRITILPQLYIQPMMGAGLNYECY-RFGISP S TNAL VI GAT VME G FYVI F 357 

: 1 M M | | : | | : : | . . | . . 

Db 313 "NGVEFPLPPSSYI LNNNGYCTVGVEPTYLSAQNSQPLWILGDVFLRSYYSVY 364 

QY 358 DRAQKRVGFAAS 369 

I : Mill: 
Db 365 DLSNNRVGFATA 376 



RESULT 6 
PEPC_CALJA 

ID PEPC_CALJA STANDARD; PRT; 388 AA. 

AC Q9N2D3; 

DT 16-OCT-2001 (Rel. 40, Created) 
DT 16-OCT-2001 (Rel. 40, Last sequence update) 
DT 10-OCT-2003 (Rel. 42, Last annotation update) 

Gastricsin precursor (EC 3.4.23.3) (Pepsinogen C) . 



DE 

GN PGC 



OS Callithrix jacchus (Common marmoset) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Platyrrhini; Callitrichidae ; 

OC Callithrix. 

OX NCBI_TaxID=94 83; 

RN [1] 

RP SEQUENCE FROM N.A., SEQUENCE OF 17-26, FUNCTION, AND ENZYME 

RP REGULATION. 



RC 

RX 

RA 

RT 

RT 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



TISSUE=Gastric mucosa; 

MEDLINE=2 02 508 34; PubMed=l 07 8 8 7 8 4 ; 

Kageyama T. ; 

"New World monkey pepsinogens A and C, and prochymosins . Purification 
characterization of enzymatic properties, cDNA cloning, and molecular' 
evolution. " ; 

J. Biochem. 127:7 61-77 0(2 000). 

FUNCTION: Hydrolyzes a variety of proteins. 

CATALYTIC ACTIVITY: More restricted specificity than pepsin A but 
shows preferential cleavage at Tyr- | -Xaa bonds; high activity' 
towards hemoglobin as substrate. 
ENZYME REGULATION: Inhibited by pepstatin. 
MISCELLANEOUS: The optimal pH is around 2. 
SIMILARITY: Belongs to peptidase family Al . 



- ! - 

- t _ 



a collaboration 
and the EMBL outstation - 
There are no restrictions on its 



This SWISS-PROT entry is copyright. It is produced through 
between the Swiss Institute of Bioinf ormatics 
the European Bioinf ormatics Institute. 

use by non-profit institutions as long as its content is m no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 

EMBL; AB038385; BAA90872.1; 
PIR; JC7246; JC7246. 
HSSP; P2 0142; 1AVF. 
MEROPS; A01.003; -. 

InterPro; IPR001969; Aspprotease_AS . 

InterPro; IPR009007; Pept_A_acid. 

InterPro; IPR001461; Peptidase_Al . 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROSITE; PS00141; ASP_PROTEASE; 2. 

Hydrolase; Aspartyl protease; Zymogen; Digestion; 



SIGNAL 
PROPEP 
CHAIN 
ACT_SITE 
ACT_SITE 
DISULFID 
DISULFID 
DISULFID 
SEQUENCE 



1 
17 
60 
91 
276 
104 
267 
310 



16 
59 
388 
91 
276 
109 
271 
343 



Signal . 



ACTIVATION PEPTIDE (BY SIMILARITY) . 

GASTRICSIN. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 



388 AA; 42503 MW; 0BC4 8DBD1F7D2D8C CRC64; 



Query Match 16.6%; Score 351.5; DB 1; Length 388; 

Best Local Similarity 30.1%; Pred. No. 1.7e-19; 
Matches 112; Conservative 56; Mismatches 115; Indels 



Qy 

Db 

Qy 

Db 



89; Gaps 17; 

30 YYLEMLIGTPPQKLQILVDTGSSNFAV AGTPHS YI DTYFDTERSSTYRSKGF 81 

' : ' : I I I I I I : I | | M | | | (I,, , . |M|,, 

73 YFGEIS IGTPPQNFLVLFDTGS SNLWVPSVYCQSQACTSHS RFNPSASSTYSSNGQ 128 

82 DVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG IKWNGILG 136 

: : : I I I I I I I I : I : | | M I II • • M I * I 

129 TFSLQYGSGSLTGFFGYDTLTV QSIQVPNQEFGLSEN— EPGTNFVYAQFDGIMG 181 



Qy 



137 LAYATLAKPSSSLETFFDSLVTQANIPN-VFSMQMCGAGLPVAGSGTNGGSLVLGGIEPS 195 



Ill I : : : I : : : : : | | | : | : : | | : : : | | : : | 

Db 182 LAY PAL SMGGAT TAMQGMLQEGALT S P VFS FYL SNQ Q G S S GGAVI FGGVD S S 233 

Qy 196 LYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFD 255 

M I I : : I : : | | : | | | : | | | | : | | . , , , , . , , . , , . . 

Db 234 LY TGQI YWAPVTQELYWQIGIEEFLIGGQASGW-CSE — GCQAIVDTGTSLLTVPQQYMS 290 

Qy 256 AWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYF PKISI YLRDENS 305 

I : I I | | : | | : | I : : : 
Db 291 AFLEA TGAQ EDEYGQFLVNCDSIQNLPTLTFI I 323 

Q y 306 SRSFRITILPQLYIQPMMGAGLNYECY-RFGISP STNALVI GATVMEGFYVI F 357 

: 1 I I I : I I : I I : : | : : | • | 

Db 324 "NGVEFPLPPSSYI LSNNGYCTVGVEPTYLS SQNSQPLWILGDVFLRSYYSVF 375 

QY 358 D RAQ KRVG FAAS 3 69 

I Mill: 
Db 37 6 DLGNNRVGFATA 387 



DT 
DT 



RESULT 7 
PEPC_HUMAN 

ID PEPC_HUMAN STANDARD; PRT ; 388 AA 

AC P20142; 

01-FEB-1991 (Rel. 17, Created) 
01-FEB-1991 (Rel. 17, Last sequence update) 
DT 28-FEB-2003 (Rel. 41, Last annotation update) 
DE Gastricsin precursor (EC 3.4.23.3) (Pepsinogen C) . 
GN PGC . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9 606; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=88087276; PubMed=333554 9 ; 

RA Hayano T w Sogawa K., Ichihara Y. , Fu j ii-Kuriyama Y., Takahashi K. ; 

RT "Primary structure of human pepsinogen C gene."; 

RL J. Biol. Chem. 2 63:1382-13 8 5(198 8). 
RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=89079679; PubMed=2909526; 

RA Taggart R.T., Cass L.G., Mohandas T.K., Derby P., Barr P.J., Pals G 

RA Bell G.I.; 

RT "Human pepsinogen C (progastricsin) . Isolation of cDNA clones, 

RT localization to chromosome 6, and sequence homology with pepsinogen 

RT • ; 

RL J. Biol. Chem. 2 64:375-37 9(198 9). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta; 

RX MEDLINE=8 92 9084 0; PubMed=256769 7 ; 

RA Pals G., Azuma T., Mohandas T.K., Bell G.I., Bacon J., 

RA Samloff I.M., Walz D . A. f Barr P . J. , Taggart R.T.; 

RT "Human pepsinogen C (progastricsin) polymorphism: evidence for a 

RT single locus located at 6p2 1 . 1-pter . " ; 

RL Genomics 4:137-148(1989). 



RN [4] 

RP SEQUENCE FROM N . A. 
RA Wong R.N.S., Tang J.; 

RL Submitted (NOV-1996) to the EMBL/ GenBank/DDBJ databases 
RN [5] 

RP SEQUENCE OF 17-101. 

RX MEDLINE=9 0 13 04 02; PubMed=2 51519 3; 

RA Athauda S.B.P., Tanji M. , Kageyama T., Takahashi K.; 
RT "A comparative study on the NH2-terminal amino acid sequences and 
RT some other properties of six isozymic forms of human pepsinogens and 
RT pepsins . " ; 

RL J. Biochem. 106:920-927(1989). 
RN [6] 

RP SEQUENCE OF 17-64. 

RX MEDLINE=8307 9318; PubMed=68 16595 ; 
RA Foltmann B., Jensen A.L.; 

RT "Human progas tricsin . Analysis of intermediates during activation 
RT into gastricsin and determination of the amino acid sequence of the 

RT propart . "; 

RL Eur. J. Biochem. 12 8:63-7 0(1982). 
RN [7] 

RP X-RAY CRYSTALLOGRAPHY (1.62 ANGSTROMS). 

RX MEDLINE=9 52 3 0 687; PubMed=7 714902; 

RA Moore S.A., Sielecki A.R., Chernaia M.M. , Tarasova N.I., James M.N.G.; 

RT "Crystal and molecular structures of human progastricsin at 1 62-A 

RT resolution."; 

RL J. Mol. Biol. 247:466-485(1995). 
RN [8] 

RP X-RAY CRYSTALLOGRAPHY (2.36 ANGSTROMS). 

RX MEDLINE=98 06964 9; PubMed=94 06551 ; 

RA Khan A.R., Cherney M.M. , Tarasova N.I., James M.N.; 

RT "Structural characterization of activation 'intermediate 2' on the 

RT pathway to human gastricsin."; 

RL Nat. Struct. Biol. 4:1010-1015(1997). 

CC -!- CATALYTIC ACTIVITY: More restricted specificity than pepsin A, but 

CC shows preferential cleavage at Tyr- | -Xaa bonds; high activity 

cc towards hemoglobin as substrate. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M18667; AAA60062.1; ALT_INIT . 

DR EMBL; M18659; AAA60062.1; JOINED. 

DR EMBL; M18660; AAA60062.1; JOINED. 

DR EMBL; M18661; AAA60062.1; JOINED. 

DR EMBL; M18662; AAA60062.1; JOINED. 

DR EMBL; M18663; AAA60062.1; JOINED. 

DR EMBL; M18664; AAA60062.1; JOINED. 

DR EMBL; M18665; AAA60062.1; JOINED. 

DR EMBL; M18666; AAA60062.1; JOINED. 

DR EMBL; M23077; AAA60063.1; -. 



DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 



EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL ; 
EMBL; 



M23069; 
M23070; 
M23071; 
M23072; 
M23073; 
M23074; 
M23075; 
J04443; 
U75272; 



JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 



AAA60063. 1 
AAA60063. 1 
AAA60063. 1 
AAA60063 . 1 
AAA60063 . 1 
AAA60063. 1 
AAA60063. 1 
AAA60074 . 1 
AAB18273.1; -. 
PIR; A29937; A29937. 
PDB; 1HTR; 26-JAN-95. 
PDB; 1AVF; 25-FEB-98. 
MEROPS; A01.003; -. 
Genew; HGNC:8 890; PGC. 
MIM; 169740; 

GO; GO: 0005615; C : extracellular space; TAS . 

GO; GO: 0004190; F: aspartic-type endopeptidase activity; TAS 
GO; GO: 0007586; P:digestion; TAS. 

GO; GO:0006508; P : proteolysis and peptidolysis; TAS. 

InterPro; IPR001969; Aspprotease_AS . 

InterPro; IPR009007; Pept_A_acid. 

InterPro; IPR001461; Peptidase_Al . 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROTEASE; 2. 



DR 


PROSITE; 


PS00141; 


ASP 


KW 


Hydrolase 


; Aspartyl pr 


KW 


3D-structure . 




FT 


SIGNAL 


1 


16 


FT 


PROPEP 


17 


59 


FT 


CHAIN 


60 


388 


FT 


ACT_SITE 


91 


91 


FT 


ACT SITE 


276 


276 


FT 


DISULFID 


104 


109 


FT 


DISULFID 


267 


271 


FT 


DISULFID 


310 


343 


FT 


CONFLICT 


40 


41 


FT 


CONFLICT 


52 


52 


FT 


STRAND 


19 


25 


FT 


HELIX 


29 


35 


FT 


TURN 


36 


37 


FT 


HELIX 


39 


43 


FT 


TURN 


44 


45 


FT 


HELIX 


50 


54 


FT 


HELIX 


65 


68 


FT 


TURN 


69 


70 


FT 


STRAND 


73 


79 


FT 


TURN 


80 


83 


FT 


STRAND 


84 


91 


FT 


TURN 


92 


93 


FT 


STRAND 


97 


101 


FT 


TURN 


102 


103 


FT 


HELIX 


107 


110 


FT 


TURN 


111 


111 


FT 


STRAND 


115 


115 


FT 


HELIX 


117 


119 


FT 


TURN 


121 


122 


FT 


STRAND 


124 


134 



ACTIVATION 
GASTRICSIN 



PEPTIDE 



GE -> ED (IN REF. 6) 
W -> S (IN REF. 6) . 



FT TURN 135 136 

FT STRAND 137 150 

FT TURN 151 152 

FT STRAND 153 163 

FT HELIX 169 173 

FT STRAND 178 181 

FT TURN 190 191 

FT HELIX 195 201 

FT TURN 202 203 

FT STRAND 209 214 

FT STRAND 221 227 

FT HELIX 232 234 

FT STRAND 235 244 

FT STRAND 251 254 

FT STRAND 256 259 

FT TURN 2 60 2 61 

FT STRAND 2 62 2 63 

FT TURN 266 269 

FT STRAND 271 275 

FT TURN 277 278 

FT STRAND 282 285 

FT HELIX 286 288 

FT HELIX 289 296 

FT TURN 297 297 

FT STRAND 299 300 

FT TURN 302 303 

FT STRAND 306 308 

FT HELIX 310 315 

FT STRAND 319 323 

FT TURN 324 325 

FT STRAND 326 330 

FT HELIX 332 335 

FT STRAND 336 338 

FT STRAND 343 345 

FT STRAND 347 350 

FT TURN 355 356 

FT STRAND 360 363 

FT HELIX 365 368 

FT TURN 369 370 

FT STRAND 371 376 

FT TURN 377 380 

FT STRAND 381 388 

SQ SEQUENCE 388 AA; 42426 MW; F8 62DFDC1438BB92 CRC64; 

Query Match 16.6%; Score 350.5; DB 1; Length 38 8; 

Best Local Similarity 30.1%; Pred. No. 2.1e-19; 

Matches 112; Conservative 59; Mismatches 112; Indels 89; Gaps 18; 

30 YYLEMLIGTPPQKLQILVDTGSSNFAV AGTPHS YIDTYFDTERSSTYRSKGF 81 

I : I : I I ! I I I : I f | | f | f I I I I I | : || I I * I 

Db 73 YFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHS RFNPSES STYSTNGQ 128 

Q ^ 82 DV TVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG IKWNGILG 136 

: = i I I I M I I : I : | | | | | | | • • • | I " I 

Db 129 TFSLQYGSGSLTGFFGYDTLTV QSIQVPNQEFGLSEN— EPGTNFVYAQFDGIMG 181 



137 LAYATLAKPSSSLETFFDSLVTQANIPN-VFSMQMCGAGLPVAGSGTNGGSLVLGGIEPS 



195 



Db 


182 


Qy 


196 


Db 


234 


Qy 


256 


Db 


291 


Ov 


306 


Db 


324 


Qy 


358 


Db 


376 



111: : I : : | | : : | | | 



«- ■ — — ~ — >c ^ j-ii. _l j.^^Tj_/i.vr-i.-L vu jui x -LjJ-jxNj_iir y r\ 

I I I I : : I : : I I : II I : I I I I : | | : I I I I : I I : | | : | | : 



rTGSQLACWTNSETPWSYF PKISIYLRDENS 305 

I I : I I : I | : : : 
TGAQ EDEYGQFLVNCNSIQNLPSLTFII 323 



1 I I I : I I : I II : : I : : | : : 

PSSYI LSNNGYCTVGVEPT YLS SQNGQPLWI LGDVFLRS YYS VY 375 



I I I I I 



RESULT 8 
PEPC_CAVPO 

ID PEPC_CAVPO STANDARD; PRT; 394 AA. 

AC Q64411; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Gastricsin precursor (EC 3.4.23.3) (Pepsinogen C) 

GN PGC . 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Hys tricognathi ; Caviidae; Cavia 
OX NCBI_TaxID=10141; 
RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=92355614; PubMed=164 4 82 9 ; 

RA Kageyama T., Ichinose M. , Tsukada S., Miki K. , Kurokawa K. , Koiwai O., 
Tanji M. , Yakabe E. , Athauda S.B., Takahashi K. ; 
"Gastric procathepsin E and progastricsin from guinea pig. 
Purification, molecular cloning of cDNAs, and characterization of 
enzymatic properties, with special reference to procathepsin E."; 
RL J. Biol. Chem. 267:16450-16459(1992). 

CC -!- CATALYTIC ACTIVITY: More restricted specificity than pepsin A, but 
CC shows preferential cleavage at Tyr- | -Xaa bonds; high activity 

cc towards hemoglobin as substrate. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 



RA 
RT 
RT 
RT 



CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; M88652; AAA37053.1; -. 

DR PIR; B43356; B43356. 



DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



HSSP; P20142; 1AVF. 
MEROPS; A01.003; -. 

InterPro; IPR001969; Aspprotease__AS. 

InterPro; IPR009007; Pept_A_acid. 

InterPro; IPR001461; Peptidase_Al . 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROSITE; PS00141; ASP_PROTEASE; 2. 

Hydrolase; Aspartyl protease; Zymogen; Digestion; Signal 



SIGNAL 
PROPEP 
CHAIN 
ACT_SITE 
ACT_SITE 
DISULFID 
DISULFID 
DISULFID 
SEQUENCE 



1 16 POTENTIAL. 

17 65 ACTIVATION PEPTIDE. 

66 394 GASTRICSIN. 

97 97 BY SIMILARITY. 

283 283 BY SIMILARITY. 

HO H5 BY SIMILARITY. 

273 277 BY SIMILARITY. 

316 349 BY SIMILARITY. 
394 AA; 42995 MW; 114F08E105D49865 CRC64; 



Query Match 15.4%; Score 324.5; DB 1; Length 394; 

Best Local Similarity 29.0%; Pred. No. 2e-17; 

Matches 107; Conservative 63; Mismatches 116; Indels 83; 



Gaps 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



3 0 YYLEMLI GTPPQKLQI LVDTGS SNF AVAGTPHS YIDTYFDTERSSTYRSKGF 

I : : : : I I I I I I : I I I | | | | : : I I I | | : I | | | • 

79 YFGQISLGTPPQSFQVLFDTGSSNLWVPSVYCSSLACTTH TRFNPRDSSTYVATDQ 



IK 



82 DVTVKYTQGSWTGFVGEDLVTI PK-GFNTSFLVNIATI FESENFFLPG 

: : : I I I I I I I : I I I I I I | : | || : 

135 SFSLEYGTGSLTGVFGYDTMTIQDIQVPKQEFGLS ETE PGSDFVYAE 



131 WNGI LGLAYATLAKPS S SLET FFDS LVTQANI - PNVFSMQMCGAGLPVAGS — GTNGGS L 
-MINI I : : : : | | : : : : : | | : : | | | : • | | 

182 FDGILGLGYPGLSEGGAT TAMQGLLREGALSQSLFSVYL GSQQGSDEGQL 

188 VLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLL 
H I I : : I I I I | I : : I I : : | | : | | | | | : | : , j , . , , . , , 

232 ILGGVDESLYTGDI YWT P VTQELYWQIGIEGFLIDGSAS GWCSR GCQGIVDTGTSLL 

248 RLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKI SI YLRDENSSR 

•'I : I : I : | : : | : : | : : | | : 
289 TVPSDYLSTLVQAI GAEE NEYGEYF VSCSSIQDLPTLTFVI SGV 



308 SFRITILPQLYIQP MMGAGLNYECYRFGI S PSTN — ALVI GATVMEGFYVI FDRA 

: ' 11 1:1 I =11 : : t : : I : : | | 

333 EFPLSPSAYILSGENYCMVGLESTY VSPGGGEPVWILGDVFLRSYYSVYDLA 

3 61 QKRVGFAAS 3 69 

I 1 I II : 
385 NNRVGFATA 393 



18; 
81 
134 
130 
181 
187 

231 

247 

288 

307 

332 

360 

384 



RESULT 9 
PEPC_RAT 

ID PEPC_RAT STANDARD; PRT; 392 AA 

AC P04073; 

DT 01-NOV-1986 (Rel. 03, Created) 



DT 01-NOV-1986 (Rel. 03, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Gastricsin precursor (EC 3.4.23.3) (Pepsinogen C) . 

GN PGC . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC ST RAI N=Wi star; 

RX MEDLINE=89255508; PubMed=27228 63 ; 

RA Ishihara T., Ichihara Y. , Hayano T . , Katsura I., Sogawa K. , 

RA Fujii-Kuriyama Y. , Takahashi K. ; 

RT "Primary structure and transcriptional regulation of rat pepsinoaen C 

RT gene . " ; ^ 

RL J. Biol. Chem. 264:10193-10199(1989). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar; 

RX MEDLINE=87 054 02 0; PubMed=37 8 074 1 ; 

RA Ichihara Y. , Sogawa K. , Morohashi K. , Fu j ii-Kuriyama Y. , Takahashi K. ; 

RT "Nucleotide sequence of a nearly full-length cDNA coding for 

RT pepsinogen of rat gastric mucosa."; 

RL Eur. J. Biochem. 161:7-12(1986). 

RN [3] 

RP SEQUENCE OF 16-112. 

RC STRAIN=Wistar; 

RX MEDLINE=84257 697; PubMed=674 367 0 ; 

RA Arai K.M. , Muto N., Tani S., Akahane K. ; 

RT "The N-terminal sequence of rat pepsinogen."; 

RL Biochim. Biophys. Acta 7 8 8:256-261(1984). 

CC -!- CATALYTIC ACTIVITY: More restricted specificity than pepsin A, but 

CC shows preferential cleavage at Tyr- | -Xaa bonds; high activity 

cc towards hemoglobin as substrate. 

CC SIMILARITY : Belongs to peptidase family Al . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M25993; AAA41827.1; 

DR EMBL; M25985; AAA41827.1; JOINED. 

DR EMBL; M25986; AAA41827.1; JOINED. 

DR EMBL; M25987; AAA41827.1; JOINED. 

DR EMBL; M25988; AAA41827.1; JOINED. 

DR EMBL; M25989; AAA41827.1; JOINED. 

DR EMBL; M25990; AAA41827.1; JOINED. 

DR EMBL; M25991; AAA41827.1; JOINED. 

DR EMBL; M25992; AAA41827.1; JOINED. 

DR EMBL; X04644; CAA28305.1; -. 

DR PIR; A33510; A24608. 

DR HSSP; P20142; 1AVF . 



DR 


MEROPS; 


A01. 003; 


* 






DR 


InterPro 


; IPR001969; Aspprotease AS. 




DR 


InterPro 


; IPR009007; Pept A 


acid. 




DR 


InterPro 


; IPR001461; Peptidase Al . 




DR 


Pfam; PF00026; asp; 1. 






DR 


PRINTS; 


PR00792; 


PEPSIN. 






DR 


PROSITE; 


PS00141; 


ASP PROTEASE; 2. 




KW 


Hydrolase; Aspartyl protease 


; Zymogen; Digestion 


FT 


SIGNAL 


1 


16 






FT 


PROPEP 


17 


62 


ACTIVATION PEPTIDE. 


FT 


CHAIN 


63 


392 


GASTRICSIN. 




FT 


ACT_SITE 


94 


94 






FT 


ACT_SITE 


280 


280 






FT 


DISULFID 


107 


112 


BY SIMILARITY. 




FT 


DISULFID 


270 


275 


BY SIMILARITY. 




T~im 

FT 


DISULFID 


314 


347 


BY SIMILARITY. 




FT 


CONFLICT 


31 


31 


E -> Q (IN REF. 


3) . 


FT 


CONFLICT 


103 


103 


S -> A (IN REF. 


3) . 


FT 


CONFLICT 


109 


109 


S -> L (IN REF. 


3) . 


SQ 


SEQUENCE 


392 AA, 


42833 MW; 


092A5EAF2783EDD1 C 


Query Match 




14. 8%; 


Score 313; DB 


1; I 



Best Local Similarity 29.5%; Pred. No. 1.5e-16; 

Matches 105; Conservative 56; Mismatches 139; Indels 



56; Gaps 



16; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



30 YYLEMLIGTPPQKLQILVDTGSSNFAV AGTPHSYIDTYFDTERSSTYRSKGF 81 

n 1=1:111111 : I I f||: | : : I | || : : I 

76 YFGEI S I GTPPQNFLVLFDTGS SNLWVS SVYCQSEACTTHA RFNPSKS STYYTEGQ 131 

82 DVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATIFESENFFLPG IKWNGILG 136 

:::| II III i I :|: | | |,| |, :::||:| 

132 TFSLQYGTGSLTGFFGYDTLTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMG 184 

137 LAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS — GTNGGSLVLGGIEP 194 

IN I : I I : : : | : j | | | | : | | | : | | | • • 
185 LAYPGLS — SGGATTALQGMLGE GALSQPLFGVYL GSQQGSNGGQIVFGGVDK 235 

195 SLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVF 254 

„ * M |:| : |: :| |: I: I : I I I : I I : I I : I : 

236 NLYTGEITWVP VTQEL YWQIT I DDFLI GDQASGW-CS SQGC-QGI VDTGTSLLVMPAQYL 293 

255 DAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITIL 314 

: ' : : 1 : : I : I I :| I : : 

294 SELLQTIGAQE GEYGEYF VSCDSVSS LPTLSFVL NGVQFPLS 335 

315 PQLY-IQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRAQKRVGFAAS 369 

I I I I : I : : I : : I I f f : I I I I 

336 PSSYIIQEDNFCMVGLESISLTSESGQPLWILGDVFLRSYYAI FDMGNNKVGLATS 391 



RESULT 10 
CAT D_C LUHA 

ID CAT D_C LUHA STANDARD; PRT; 3 96 AA 

AC Q9DEX3 ; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 {Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 



DE Cathepsin D precursor (EC 3.4.23.5). 
OS Clupea harengus (Atlantic herring) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
OC Actinopterygii; Neopterygii; Teleostei; Clupeomorpha ; Clupeidae • 
OC Clupea. 

OX NCBI_TaxID=7 95 0; 
RN [1] 

RP SEQUENCE FROM N.A. 

RA Nielsen L.B., Stougaard P., Andersen P.S., Pedersen L.H.; 

RT "Cloning and sequence determination of herring muscle cathepsin D 

RL Submitted (OCT-2000) to the EMBL/ GenBank/DDB J databases 

RN [2] 

RP SEQUENCE OF 62-82. 

RC TISSUE=Skeletal muscle; 

RX MEDLINE=211654 69; PubMed=11207447 ; 

RA Nielsen L.B., Nielsen H.H.; 

RT "Purification and characterization of cathepsin D from herring muscle 
RT (Clupea harengus)."; 

RL Comp. Biochem. Physiol. 12 8B : 351-363 ( 2 001 ) . 

CC -!- FUNCTION: Cathepsin D is an acid protease active in intracellular 
CC protein breakdown. 

CC -!- CATALYTIC ACTIVITY: Specificity similar to, but narrower than, 
cc that of pepsin A. Does not cleave the 4-Gln- | -His-5 bond in B 

CC chain of insulin. 

CC -!- ENZYME REGULATION: Inhibited by pepstatin. 
CC -!- SUBUNIT: Monomer. 

CC SUBCELLULAR LOCATION: Lysosomal. 

CC -!- MISCELLANEOUS: The isoelectric point is 6.8. Has optimal activity 

cc at pH 2.5 with hemoglobin as the substrate and the optimal 

CC temperature is 37 degrees Celsius. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; AF312364; AAG27733.1; -. 

DR HSSP; P07339; 1LYB. 

DR MEROPS; A01.009; -. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR009007; Pept_A_acid. 

DR InterPro; IPR001461; Peptidase_Al . 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Hydrolase; Aspartyl protease; Glycoprotein; Lysosome; Signal; Zvmoqen 

FT SIGNAL 1 18 POTENTIAL. 

FT PROPEP 19 61 ACTIVATION PEPTIDE. 

FT CHAIN 62 39 6 CATHEPSIN D. 

FT ACT_SITE 94 94 BY SIMILARITY. 

FT ACT_SITE 281 281 BY SIMILARITY. 

FT DISULFID 107 114 BY SIMILARITY. 

FT DISULFID 272 276 BY SIMILARITY. 



FT DISULFID 315 352 BY SIMILARITY . 

FT CARBOHYD 131 131 N-LINKED (GLCNAC. . . ) ( POTENTIAL) 

SQ SEQUENCE 396 AA; 43315 MW; D0375DC3 8567A31B CRC64; 

Query Match 14.7%; Score 311; DB 1; Length 396; 

Best Local Similarity 28.3%; Pred. No. 2.2e-16; 

Matches 105; Conservative 59; Mismatches 125; Indels 82; Gaps 16; 

Qy 30 YYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDT YFDTERSSTYRSKGFDV 83 

" I : : I I I I : : I I f I I | : | : | : . . , | , , , . 

Db 76 YYGEI GLGTPVQMFTWFDTGS SNLWLPS IHCS FTDIACLLHHKYNGAKS ST YVKNGTEF 135 

Q y 84 TVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPGI KWNGILGLA 138 

: : I I I : I : : : I | | : | : | | j | : I : * I I I I * I 

Db 136 AIQYGSGSLSGYLSQDSCTI GDIWEKQLF-GEAI KQPGVAFIAAKFDGILGMA 188 

Qy 139 YATLAKPSSSLETFFDSLVTQANI-PNVFSMQMCGAGLPVAGSGTN GGSLVLGGI 192 

1 : : : I I : : : I : | | | | : | I I | : | | | 

Db 189 YPRIS VDGVPPVFDMMMSQKKVEQNVFSFYL NRNPDTEPGGELLLGGT 236 

Q y 193 EPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQK 252 

: 1 1 I ' I I : : I : I I : : I I I I | | : : : I I I I : I I : | : | 

237 DPKYYTGDFNYVPVTRQAYWQIHMDGMSIGSQ-LTL-CKD — GCEAIVDTGTSLITGPPA 292 

Qy 253 VFDAWEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKI SI 298 

I : : I * M I I | | : I I I | 
Db 293 EVRALQKAIGAIPLIQGEYMIDCKKVPTLPTIS — FNVGGK TYSLTGEQY 340 

Q ^ 299 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVI FD 358 

I ■ • : : - I III | : • • | • . | . | j 

Db 341 VLKESQGGKTICLSGLMGLEIPP PAGPLWILGDVFIGQYYTVFD 384 

Qy 359 RAQKRVGFAAS 369 

I MINI 
Db 385 RESNRVGFAKS 3 95 



Db 



DT 
DT 



RESULT 11 
PEPE_CHICK 

ID PEPE_CHICK STANDARD; PRT; 383 AA 

AC P16476; 

DT 01-AUG-1990 (Rel. 15, Created) 

01-AUG-1990 (Rel. 15, Last sequence update) 
28-FEB-2003 (Rel. 41, Last annotation update) 
DE Embryonic pepsinogen precursor (EC 3.4.23.-). 
OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=88227903; PubMed=3131317 ; 

RA Hayashi K. , Agata K., Mochii M. , Yasugi S w Eguchi G., Mizuno T.; 

RT "Molecular cloning and the nucleotide sequence of cDNA for embryonic 

RT chicken pepsinogen: phylogenetic relationship with prochymosin "; 

RL J. Biochem. 103:2 90-2 96(1988). 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



_ i _ 



_ i _ 



DEVELOPMENTAL STAGE: Specifically secreted during the embryonic 
period m the chicken proventriculus (glandular stomach) 
SIMILARITY: Belongs to peptidase family Al . 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch ) . 



EMBL; D00215; BAA00153.1; 
PIR; A41443; A41443. 
HSSP; P00794; 4 CMS . 
MEROPS; AO 1.02 8; 

InterPro; IPR001969; Aspprotease_AS. 

InterPro; IPR009007; Pept_A_acid. 

InterPro; IPR0014 61; Peptidase_Al . 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROSITE; PS00141; ASP_PROTEASE; 2. 

Hydrolase; Aspartyl protease; Digestion; Signal; Glycoprotein. 



SIGNAL 
CHAIN 
ACT_SITE 
ACT_SITE 
DISULFID 
DISULFID 
DISULFID 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
VARIANT 
SEQUENCE 



1 
17 
94 
276 
107 
267 
310 
132 
204 
309 
350 
51 
383 M, 



16 
383 
94 
276 
112 
271 
344 
132 
204 
309 
350 
51 

41719 



MW; 



POTENTIAL. 
EMBRYONIC PEPSINOGEN. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
N-LINKED (GLCNAC. . . 
N-LINKED (GLCNAC. . . 
N-LINKED (GLCNAC. . . 
N-LINKED (GLCNAC. . . 
T -> S. 

1642796871611F54 



) (POTENTIAL) 

) (POTENTIAL) 

) (POTENTIAL) 

) (POTENTIAL) 



Query Match 14.5%; Score 306.5; DB 1; 

Best Local Similarity 26.9%; Pred. No. 4.6e-16; 
Matches 96; Conservative 62; Mismatches 128; 



CRC64; 



Length 3 83; 



Indels 



71; Gaps 12; 



QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



30 YYLEMLIGTPPQKLQILVDTGSSNFAVAG TPHSYIDTYFDTERSSTYRSKGFDVTV 85 

II : MINI :: || Mil I :| | : :||||s| | :::: 

76 YYGTISIGTPPQDFTWFDTGSSNLWVPSVSCTSPACQSHQMFNPSQSSTYKSTGQNLSI 135 

86 KYTQGSWTGFVGEDLVTI PKGFNTS FLVNI ATI FES ENFFLPGI KWNGI LGLAYATLAKP 145 
1 I I M I I I : : I : | : : | | | | : : I : : | | | | | | : | | 
136 HYGTGDMEGTVGCDTVTVAS LMDTNQLFGLST-SEPGQFFV- YVKFDGILGLGYPSLA 191 

146 SSSLETFFDSLVTQANI-PNVFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYT 204 

: : I I : : I : : : I : I I : : : | : I | | | : | • | | ■ 

192 ADGITPVFDNMVNESLLEQNLFSVYLSREPM GSMWFGGIDESYFTGSINWI 243 

205 PIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARA 264 

I : : I : i I : : : | : : : | | : | : | | : | : | s . | | 

244 PVSYQGYWQISMDSIIVNKQEIACS SGCQAI IDTGTSLVAGPASDINDIQSAVG — 297 



Q y 265 SLIPEFSDG ™TGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITILPQLYIQPMMG 324 

III I I : I : : : | 

Db 298 ANQNTYGEY SVNCSHILAMPDWF— VIG 324 

QY 325 AGLNY ECYRFGI S PSTNALVI GATVMEGFYVI FDRAQKRVGFA 367 

' : ' ' : I : : : | : : I I I I I I I I I I 

Db 325 -GIQYPVPALAYTEQNGQGTCMS S FQNS SADLWI LGDVFI RVYYS I FDRANNRVGLA 380 

RESULT 12 
CATD_MOUSE 

ID CATDJVTOUSE STANDARD; PRT; 410 AA 

AC P18242; 

DT 01-NOV-1990 (Rel. 16, Created) 

DT 01-NOV-1990 (Rel. 16, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Cathepsin D precursor (EC 3.4.23.5). 

GN CTSD. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=C57BL/6J; TISSUE=Brain; 

RX MEDLINE=91088345; PubMed=22 63503 ; 

RA Diedrich J.F., Staskus K.A. , Retzel E.F., Haase A.T.; 

RT "Nucleotide sequence of a cDNA encoding mouse cathepsin D."; 

RL Nucleic Acids Res. 18:7184-7184(1990) 

RN [2] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=90326544; PubMed=2374 732 ; 

RA GrusbyM.J., Mitchell S.C., Glimcher L.H.; 

RT "Molecular cloning of mouse cathepsin D."; 

RL Nucleic Acids Res. 18:4 0 08-4 008(1990) 

RN [3] 

RP SEQUENCE FROM N . A. 

RC STRAIN=C57BL/6J; 

RX MEDLINE=94280622; PubMed-8011168 ; 

RA Hetman M. , Perschl A., Saftig P., von Figura K . , Peters C; 

RT "Mouse cathepsin D gene: molecular organization, characterization of 

RT the promoter, and chromosomal localization."; 

RL DNA Cell Biol. 13:419-427(1994). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRA1N=C57BL/6J; TISSUE=Brain, and Mammary gland; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N K 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F . , 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T E 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. * 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy s'j 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A. , Gunaratne P H , 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 



RA 

RA 

RA 

RA 

RA 

RA 

RA 

RT 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 



- ! - 
_ i _ 



Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A. , 

Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

"Generation and initial analysis of more than 15,000 full-length 

human and mouse cDNA sequences."; 

Proc. Natl. Acad. Sci. U.S.A. 99:168 99-16903(2 002). 

FUNCTION: Acid protease active in intracellular protein breakdown 
CATALYTIC ACTIVITY: Specificity similar to, but narrower than, 
that of pepsin A. Does not cleave the 4-Gln- | -His-5 bond in b' 
chain of insulin. 

SUBUNIT: Consists of a light chain and a heavy chain. 
SUBCELLULAR LOCATION: Lysosomal. 
SIMILARITY: Belongs to peptidase family Al . 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; 


X53337; 


CAA37423. 


1, 




EMBL; 


X52886; 


CAA37067 . 


1, 




EMBL; 


X68378; 


CAA48453. 


1, 


f * 


EMBL ; 


X68379; 


CAA48453 . 


1, 


; JOINED. 


EMBL; 


X68380; 


CAA48453. 


1; 


f JOINED. 


EMBL; 


X68381; 


CAA4 8 4 53. 


1; 


' JOINED. 


EMBL; 


X68382; 


CAA48453. 


1; 


JOINED. 


EMBL ; 


X68383; 


CAA48453. 


1; 


JOINED. 


EMBL ; 


BC054758; AAH54758. 


l; -. 


EMBL; 


BC057931; AAH57931. 


1; 



PIR; 148278; KHMSD. 
HSSP; P07339; 1LYB. 
MEROPS; A01.009; -. 
MGD; MGI: 88562; Ctsd. 

InterPro; IPR001969; Aspprotease_AS . 

InterPro; IPR009007; Pept__A_acid. 

InterPro; IPR0014 61; Peptidase_Al . 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROSITE; PS00141; ASP PROTEASE; 2. 



FT 


SIGNAL 


1 


20 


FT 


PROPEP 


21 


64 


FT 


CHAIN 


65 


410 


FT 


ACT_SITE 


97 


97 


FT 


ACT_SITE 


293 


293 


FT 


DISULFID 


91 


160 


FT 


DISULFID 


110 


117 


FT 


DISULFID 


284 


288 


FT 


DISULFID 


327 


364 


FT 


CARBOHYD 


134 


134 



POTENTIAL. 

ACTIVATION PEPTIDE 

CATHEPSIN D. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

N-LINKED (GLCNAC. , 



(POTENTIAL) 



) (BY SIMILARITY) 



FT CARBOHYD 261 261 N-LINKED (GLCNAC. . .) (BY SIMILARITY) . 

SQ SEQUENCE 410 AA; 44954 MW; DC4 92 8EC4 692 8BF0 CRC64; 

Query Match 14.5%; Score 306.5; DB 1; Length 410; 

Best Local Similarity 27.5%; Pred. No. 5e-16; 

Matches 103; Conservative 64; Mismatches 123; Indels 85; Gaps 15; 

Q y 30 YYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDT YFDTERSSTYRSKGFDV 83 

I I : I I I I I I :: I I I I I I I : | : : : : - | | | , , 

Db 79 YYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSF 138 

Q y 84 TVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNI AT IFESENFFLPGI KWNGIL 135 

: I I I = I : : : I I : : I : | | | | | | | | | : : | | | 

Db 139 DIHYGS GSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIF-GEATKQPGIVFVAAKFDGIL 197 

QY 136 G ^ YATL ^ PS SSLETFFDSLVTQANI-PNVFSMQMCGAGLPVAGSGTNGGSLVLGGIEP 194 

i : I : : : : : I I : I : I : I : I I | | | | | • | | | - 

Db 198 GMGYPHIS— VNNVLPVFDNLMQQKLVDKNIFSFY LNRDPEGQPGGELMLGGTDS 250 

Q ¥ 195 SLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVF 254 

1 |:: I : : I M : : : M : i : I I | : : I I I I : I I : | | | : 

Db 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNE-LTL-CK — GGCEAIVDTGTSLLVGPVEEV 306 

QY 255 DAWEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKISI YLRDENSSRSFRITIL 314 

: * I : I I . | | 

Db 307 KELQKAIGAVPLI QGEYMIPCEKVSSL 333 

QY 315 PQLYIQPMMGAGLNYEC YRFGIS PSTNALVI GATVMEG 352 

I : I : : : I I I I I | : | | | . . , 

Db 334 PTVYLK LG-GKNYELHPDKYILKVSQGGKTICLSGFMGMDIPPPSGPLWILGDVFIGS 390 

QY 353 FYVIFDRAQKRVGFA 3 67 

: I = M I I I I I I 
Db 391 YYTVFDRDNNRVGFA 4 05 

RESULT 13 
CATD_RAT 

ID CATD_RAT STANDARD ; PRT; 4 07 AA 

AC P24268; 

DT 01-MAR-1992 (Rel. 21, Created) 

01-MAR-1992 (Rel. 21, Last sequence update) 
28-FEB-2003 (Rel. 41, Last annotation update) 
DE Cathepsin D precursor (EC 3.4.23.5). 
GN CTSD. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus 
OX NCBI_TaxID=10116; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Pituitary ; 
RX MEDLINE=91057150; PubMed=224 38 02 ; 
RA Birch N.P., Loh Y.P.; 

RT "Cloning, sequence and expression of rat cathepsin D."; 
RL Nucleic Acids Res. 18:64 45-6445(19 90) 
RN [2] 



DT 

DT 



RP 

RC 

RX 

RA 

RT 

RT 

RT 

RL 

RN 

RP 

RX 

RA 

RA 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 



SEQUENCE FROM N.A. , AND SEQUENCE OF 65-74; 118-127 AND 165-174 
TISSUE=Liver; 

MEDLINE=9135424 9; PubMed=18 8 3350 ; 

Fujita H., Tanaka Y. , Noguchi Y. , Kono A. , Himeno M. , Kato K. ; 
"Isolation and sequencing of a cDNA clone encoding rat liver 
lysosomal cathepsin D and the structure of three forms of mature 
enzymes . " ; 

Biochem. Biophys . Res. Commun. 179:190-196(1991) 
[3] 

SEQUENCE OF 134-170. 
MEDLINE=89 034127; PubMed=3 1 82 8 00 ; 

Yonezawa S., Takahashi T., Wang X., Wong R.N.S., Hartsuck J A 
Tang J. ; ' 

"Structures at the proteolytic processing region of cathepsin D " • 
J. Biol. Chem. 2 63:16504-16511(198 8). * ' 

FUNCTION: Acid protease active in intracellular protein breakdown 
CATALYTIC ACTIVITY: Specificity similar to, but narrower than, 
that of pepsin A. Does not cleave the 4-Gln- | -His-5 bond in B 
chain of insulin . 

SUBUNIT: Occurs as a mixture of both a single chain form and two 
types of two chain (light and heavy) forms. 
SUBCELLULAR LOCATION: Lysosomal. 
SIMILARITY: Belongs to peptidase family Al . 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch ) . 

EMBL; X54467; CAA3834 9.1; 
PIR; S13111; KHRTD. 
HSSP; P07339; 1LYB. 
MEROPS; A01.009; -. 

InterPro; IPR001969; Aspprotease_AS . 

InterPro; IPR009007; Pept_A_acid. 

InterPro; IPR001461; Peptidase_Al . 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROSITE; PS00141; ASP_PROTEASE; 2. 

Hydrolase; Aspartyl protease; Glycoprotein; Zymogen; Signal; 
Lysosome . 



— ! - 

- ( - 



- ! - 

- ! - 

- » _ 



SIGNAL 


1 


20 


POTENTIAL. 




PROPEP 


21 


64 


ACTIVATION PEPTIDE 


(POTENTIAL) . 


CHAIN 


65 


407 


CATHEPSIN D. 


CHAIN 


65 


164 


CATHEPSIN D 12 kDa 


LIGHT CHAIN. 


CHAIN 


165 


407 


CATHEPSIN D 30 kDa 


HEAVY CHAIN. 


CHAIN 


65 


117 


CATHEPSIN D 9 kDa ; 


LIGHT CHAIN. 


CHAIN 


118 


407 


CATHEPSIN D 34 kDa 


HEAVY CHAIN. 


ACT SITE 


97 


97 


BY SIMILARITY. 


ACT SITE 


290 


290 


BY SIMILARITY. 




DISULFID 


91 


160 


BY SIMILARITY. 




DISULFID 


110 


117 


BY SIMILARITY. 




DISULFID 


281 


285 


BY SIMILARITY. 




DISULFID 


324 


361 


BY SIMILARITY. 





FT CARBOHYD 134 134 N-LINKED ( GLCNAC . . .) ( POTENTIAL) 

FT CARBOHYD 258 258 N-LINKED (GLCNAC. . .) (POTENTIAL) 

FT CONFLICT 15 15 D -> A (IN REF . 2). 

FT CONFLICT 163 163 D -> T (IN REF 3) 

FT CONFLICT 205 205 K -> N (IN REF. 2)\ 

FT CONFLICT 262 262 K -> N (IN REF. 2). 

SQ SEQUENCE 407 AA; 44680 MW; C42 3AD4 1 04D95F84 CRC64; 

Query Match 14.4%; Score 305; DB 1; Length 4 07; 

Best Local Similarity 29.4%; Pred. No. 6.5e-16; 

Hatches 104; Conservative 61; Mismatches 143; Indels 46; Gaps 15; 

Q y 30 YYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDT YFDTERSSTYRSKGFDV 83 

111:111111 :: | | | | | | | : | . . . . . , , , , , 

Db 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWVliHKYNSDKSSTYVKNGTSF 138 

Q ^ 84 TVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPGI KWNGILGLA 138 

s I I I : I : : : I | : : | : : | | | | | : I : : | | | | : 

Db 139 DIHYGSGSLSGYLSQDTVSVPCKSDLGGIKVEKQIF-GEATKQPGVVFIAAKFDGILGMG 197 

Q ^ 139 YATLAXPSSSLETFFDSLVTQANI-PNVFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLY 197 

I : : : : I I : I : I : I : I I | : | | | | : | | | : | 

Db 198 YPFIS—VNKVLPVFDNLMKQKLVEKNIFSFY LNRDPTGQPGGELMLGGTDSRYY 250 

Q y 198 KGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNAI)KAI VT5SGTTLLRLPQKVFDAV 257 

1 : : 1 : s I s I : : : I I : | : | | | : : I I I I : I I : I I I | | 

Db 251 HGELSYLNVTRKAYWQVHMDQLEVGSE-LTL-CK-- GGCEAIVDTGTSLLVGP VDEV 303 

QY 258 VEAVARASLIPEFSDGFWTGSQLACWTNSETPWS YFPKISI YLRDENSSRSFRITILPQL 317 

1 : 1 : : I I I I : I : I : | : 

Db 304 KELQKAIGAVPLIQGEY MIPC EKVSSLPI ITFKLGGQN YELHPEK 34 8 

Q y 318 YIQPMMGAGLNYECYRF GISPSTNAL-VIGATA/MEGFYVIFDRAQKRVGFA 367 

II s II I II: I : : | : : | : | | | , | I ! t 

Db 349 YILKVSQAGKTICLSGFMGMDIPPPSGPLWILGDVFIGCYYTVFDREYNRVGFA 402 



DT 
DT 



RESULT 14 
RENI_MOUSE 

ID RENI_MOUSE STANDARD ; PRT; 402 AA. 

AC P06281; P97911; Q62153; Q62154; 
DT 01-JAN-1988 (Rel. 06, Created) 

01-JAN-1988 (Rel. 06, Last sequence update) 
15-MAR-2004 (Rel. 43, Last annotation update) 
DE Renin 1 precursor (EC 3.4.23.15) (Angiotensinogenase ) (Kidney renin) 
GN REN1 OR REN-1 OR REN. 
OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN=BALB/ c ; 

RX MEDLINE=84 182 525; PubMed=637 068 6; 
RA Holm I., Olio R., Panthier J. -J., Rougeon F. ; 

"Evolution of aspartyl proteases by gene duplication: the mouse renin 
gene is organized in two homologous clusters of four exons."; 



RT 
RT 



RT 
RT 



RA 
RT 
RT 



RL EMBO J. 3:557-562(1984). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; TlSSUE=Kidney ; 

RX MEDLINE=90067953; PubMed=2 68 5761 ; 

RA KimW.S., Murakami K. , Nakayama K. ; 

RT "Nucleotide sequence of a cDNA coding for mouse Renl preprorenin »* - 

RL Nucleic Acids Res. 17:94 80-94 80(198 9) * ' 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=DBA/ 2 , and C57BL/10; 

RX MEDLINE=90108722; PubMed=2 69l33 9 ; 

RA Burt D.W., Mullins L.J., George H., Smith G., Brooks J., Pio li D 

RA Brammar W.J.; 

"The nucleotide sequence of a mouse renin-encoding gene, Ren-ld and 
its upstream region. ' 
RL Gene 84:91-104(1989). 
RN [4] 

RP SEQUENCE OF 1-30 FROM N.A. 
RC TISSUE=Kidney; 

RX MEDLINE=84298161; PubMed=608 92 05 ; 

Panthier J . - J . , Dreyfus M. , Roux D.T.L., Rougeon F. ; 

"Mouse kidney and submaxillary gland renin genes differ in their 5' 
putative regulatory sequences."; 
RL Proc. Natl. Acad. Sci. U.S.A. 81:54 89-5493(1984) 
RN [5] 

RP SEQUENCE OF 1-31 FROM N.A. 
RX MEDLINE=85085936; PubMed=6392 850 ; 

RA Field L.J., Philbrick W.M., Howies P.N., Dickinson D P 
RA McGowan R.A., Gross K.W.; 

"Expression of tissue-specific Ren-1 and Ren-2 genes of mice- 
comparative analysis of 5 1 -proximal flanking regions "; 
RL Mol. Cell. Biol. 4:2321-2331(1984). 
RN [6] 

RP SEQUENCE OF 22-37 AND 72-80. 

RC STRAIN=C57BL/10ROS X C3H/HEROS; TISSUE=Kidney ; 
RX MEDLINE-97182599; PubMed=903 0738 ; 

RA Jones C.A., Petrovic N. , Novak E.K., Swank R.T., Sigmund C D 
RA Gross K.W.; 

RT "Biosynthesis of renin in mouse kidney tumor As 4.1 cells."; 
RL Eur. J. Biochem. 243:181-190(1997). 

CC -!- FUNCTION: Renin is a highly specific endopeptidase, whose only 
CC known function is to generate angiotensin I from angiotensinogen 

ZZ x ? the P lasma ' initiating a cascade of reactions that produce an 

cc elevation of blood pressure and increased sodium retention by the 

CC kidney. J 

CC -!- CATALYTIC ACTIVITY: Cleaves Leu- | - bond in angiotensinogen to 

^ generate angiotensin I. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: Kidney. 

CC -!- INDUCTION: Renal renin is synthesized by the juxtaglomerular cells 
CC of the kidney m response to decreased blood pressure and sodium 

CC concentration. 

CC -!- POLYMORPHISM: In inbred mouse strains, there are at least two 
CC alleles which can occur at the Renl locus: Ren-ID and Ren-lC. 

cc The sequence shown is that of Ren-lC. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 



RT 
RT 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 



DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is 
modified and this statement is not removed. Usage by and fo 
entities requires a license agreement (See http : //www. isb-sib . 
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X00851; 


CAA25391 . 
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JOINED. 


DR 


EMBL; 


X16642; 


CAA34636. 


1; 




DR 


EMBL; 


K02596; 


AAA40045 . 


1; 




DR 


EMBL; 


M32352; 


AAA40043 . 


1; 




DR 


EMBL; 


K02800; 


AAA40044 . 


1; 




DR 


EMBL; 


M34190; 


AAA40042 . 


1; 




DR 


PIR; A00989; 


REMSK. 






DR 


HSSP; 


P00796; 


1SMR. 






DR 


MEROPS; A01.007; -. 







MGD; MGI: 97898; Renl . 

InterPro; IPR001969; Aspprotease_AS . 

InterPro; IPR009007; Pept_A_acid. 

InterPro; IPR001461; Peptidase_Al . 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROSITE; PS00141; ASP_PROTEASE ; 2. 

Hydrolase; Aspartyl protease; Plasma; Glycoprotein; Zymogen; 
Signal . 



SIGNAL 

PROPEP 

CHAIN 

ACT_SITE 

ACT_SITE 

DISULFID 

DISULFID 

CARBOHYD 

CARBOHYD 

CARBOHYD 

VARIANT 

VARIANT 

VARIANT 

VARIANT 

VARIANT 

CONFLICT 

CONFLICT 

CONFLICT 

SEQUENCE 



1 
22 
72 
102 
287 
115 
278 
69 
139 
320 
58 
68 
160 
315 
352 
6 
24 
163 
402 



AA; 



21 

71 
4 02 
102 
287 
122 
282 

69 
139 
320 

58 

68 
160 
315 
352 

23 

24 
163 

44342 



ACTIVATION PEPTIDE 

RENIN 1. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

N— LINKED (GLCNAC. . 

N— LINKED (GLCNAC. , 

N— LINKED (GLCNAC. . 



MW; 



W -> R (in Ren-ID) . 
T -> I (in Ren-ID) . 
S -> V (in Ren-ID) . 
E -> D (in Ren-ID) . 
N -> Y (in Ren-ID) . 
MISSING (IN REF. 1) . 
T -> I (IN REF. 1) . 
V -> VSRV (IN REF. 1) . 
D42920B555E97A38 CRC64 ; 



) (POTENTIAL) 
) (POTENTIAL) 
) (POTENTIAL) 



Query Match 14.4%; Score 304; DB 1; Length 402; 

Best Local Similarity 29.0%; Pred. No. 7.6e-16; 



Matches 108; Conservative 60; Mismatches 152; Indels 52; Gaps 17; 

Qy 6 LASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSY 63 

I I I I : I II I I I : I I I M I : : : i | | | : | | | | 

70 LTSPWLTNYL NTQ YYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLY 119 



Db 



64 IDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI F 119 

I : : : : I I = I I I I : I I I I : : I I I : | : | 

Db 120 LACGIHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDSVTV-GGITVTQTFGEVTEL 178 

Qy 120 ES ENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANI-PNVFSMQMCGAGLPVA 17 8 

II I : : | : | | : : | : : | | : : : | : | | | : 

Db 179 PLIPFML AKFDGVLGMGFP — AQAVGGVTPVFDHILSQGVLKEEVFSVYY NR 228 

Q y 179 GSGTNGGSLVLGGIEPSLYKGD1WYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 238 

II I I : I I I I : I 1:1: I I : : | | : : : | | | | | | 

Db 229 GSHLLGGEWLGGSDPQHYQGNFHYVSISKTDSWQITMKGVSVG — SSTLLCEEGCA V 284 



Q y 239 IVDS GTTLLRLPQKVFDAWEAV-ARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKIS 297 

: I I : I : : : I : : : I : | : | |: : | | : ) , M 

285 WDTGSSFISAPTSSLKLIMQALGAKEKRIEEY WNC SQVP TLPDIS 331 



Db 



Qy 29 8 IYLRDENSSRSFRITILPQLYIQPMMGAGL-NYECYRFGISPSTNAL-VIGATVMEGFYV 355 

I I : : : : = I I : I I I : I : | | | : M 

Db 332 FDL GGRAYTLSSTDYVLQYPNRRDKLCTLALHAMDIPPPTGPVWVLGATFIRKFYT 387 

QY 35 6 I FDRAQKRVGFA 3 67 

Ml I : I I I 
Db 38 8 EFDRHNNRIGFA 399 



RESULT 15 
PEPl_GADMO 

ID PEPl_GADMO STANDARD; PRT; 324 AA. 

AC P56272; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Pepsin IIB (EC 3.4.23.-). 

OS Gadus morhua (Atlantic cod) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleos tei ; 
OC Acanthomorpha; Paracanthopterygii ; Gadiformes; Gadidae; Gadus 
OX NCBI_TaxID=8 04 9; 
RN [1] 

RP SEQUENCE, AND X-RAY CRYSTALLOGRAPHY. 
RC TISSUE-Stomach; 

Karlsen S., Hough E., Olsen R.L.; 

"Structure and proposed amino-acid sequence of a pepsin from Atlantic 
RT cod (Gadus morhua)."; 

RL Acta Crystallogr. D 54:32-4 6(1998). 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

DR PDB; 1AM5; 24-DEC-97. 

InterPro; IPR001969; Aspprotease_AS . 
InterPro; IPR009007; Pept_A_acid. 
InterPro; IPR001461; Peptidase_Al . 
DR Pfam; PF00026; asp; 1. 



RA 
RT 



DR 
DR 
DR 



DR 

DR 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 



PRINTS; PR00792 
PROSITE; PS0014 
Hydrolase; Aspa 



ACT_SITE 32 32 

ACT_SITE 214 214 

DISULFID 45 50 

DISULFID 206 209 

DISULFID 247 280 

STRAND 2 9 

TURN 10 12 

STRAND 13 20 

TURN 21 24 

STRAND 25 32 

TURN 33 34 

STRAND 38 40 

STRAND 42 42 

TURN 4 3 4 4 

HELIX 48 51 

TURN 52 52 

STRAND 56 56 

HELIX 58 60 

TURN 62 63 

STRAND 65 74 

STRAND 79 90 

STRAND 96 106 

TURN 110 114 

STRAND 119 122 

HELIX 126 128 

HELIX 130 132 

HELIX 136 142 

TURN 143 144 

STRAND 150 154 

TURN 158 159 

STRAND 163 167 

HELIX 172 174 

STRAND 175 175 

STRAND 180 187 

TURN 188 189 

STRAND 190 194 

STRAND 196 199 

TURN 200 201 

STRAND 202 203 

STRAND 209 213 

TURN 215 216 

STRAND 220 222 

TURN 224 226 

HELIX 227 234 

TURN 235 235 

STRAND 237 238 

STRAND 243 244 

TURN 247 248 

STRAND 256 260 

TURN 261 262 

STRAND 263 267 

HELIX 269 272 

STRAND 273 275 

STRAND 280 282 



; PEPSIN. 

1; ASP_PROTEASE; 2. 

rtyl protease; Digestion; 3D-structure 



BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
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FT 


HELIX 


301 


306 


FT 


STRAND 


307 


312 


FT 


TURN 


313 


316 


FT 


STRAND 


317 


324 


SQ 


SEQUENCE 


324 AA; 


34 



324 AA; 34014 MW; EE3A6097B694 1DD7 CRC64 ; 

Query Match 14.3%; Score 302; DB 1; Length 324; 

Best Local Similarity 25.6%; Pred. No. 8.1e-16; 

Matches 94; Conservative 75; Mismatches 134; Indels 64; Gaps 15; 

Qy 18 MVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNF AVAGTPHSYIDTYFD 69 

:::::: I i : I I I I I : : : : I I I I M I I : I : I 
Db 2 VTEQMKNEADTEYYGVI S I GTP PES FKVI FDTGS SNLWVS S SHCSAQACSNHN KFK 57 

Qy 70 TERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIATI FESENF FL 126 

: I I I I I I : I I | : | : | | : : | : : : | | : | 

Db 58 PRQSSTYVETGKTVDLTYGTGGMRGILGQDTVSVGGGSDPN QELGESQTEPGPFQ 112 

Qy 127 PGI KWNGI LGLAYATLAKPS S S LETFFDS LVTQANI - PNVFSMQMCGAGLPVAGSGTNGG 185 

: : I I I I I I I : : I : : | | : : : | : : : : | | : : I I I I 

Db 113 AAAPFDGILGLAYPSIA — AAGAVPVFDNMGSQSLVEKDLFSFYL SGGGANGS 163 

Qy 186 SLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLD-CREYNADKAIVDSGT 244 

: : I I i : : I I I I : I : I I : I : : : : I I : : I : I I I I : I I 

Db 164 EVMLGGVDNSHYTGSIHWIPVTAEKYWQVALDGITVNGQTAACEGC QAIVDTGT 217 

Qy 245 TLLRLPQKVFDAVVEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKISI YLRDEN 304 

: : I : : : : : : | | : | ; | | 
Db 218 SKIVAPVSALANIMKDIGASE NQGEMMGN CASVQSLPDITF 258 

Qy 305 SSRSFRITILPQLYI QPMMGAGLN YECYRFGI S PSTNAL-VI GATVMEGFYVI FDRA 360 

: : : I II I : I I I : : I : I : I : : i i : I I 

Db 259 TINGVKQPLPPSAYIEGDQAFCTSGLGSS GVPSNTSELWI FGDVFLRNYYTI YDRT 314 

Qy 361 QKRVGFA 367 

: I I I I 

Db 315 NNKVGFA 321 



Search completed: March 4, 2004, 15:36:25 
Job time : 13.9574 sees 



